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PROTOPORPHYRINOGEN OXIQASE ("PROTOX") GENES 

The present invention relates to DNA molecules encoding protoporphyrinogen 
oxidase ("protox") enzymes and ttf DNA molecules encoding herbicide-tolerant forms of the 
enzyme protoporphyrinogen oxidase ("protox"). The invention further relates to herbicide- 
tolerant plants as well as methods for tissue culture selection and herbicide application 
based on these herbicide-tolerant forms of protox. 

I. The Protox Enzyme and its Involvement in the Chlorophyll/Heme Biosynthetic Pathway 

The biosynthetic pathways that lead to the production of chlorophyll and heme share 
a number of common steps. Chlorophyll is a light han/esting pigment present in all green 
photosynthetic organisms. Heme is a cofactor of hemoglobin, cytochromes, P450 mixed- 
function oxygenases, peroxidases, and catalyses {see, e.g. Lehninger, Biochemistry, Worth 
Publishers, New York (1 975)), and is therefore a necessary component for all aerobic 
organisms. 

The last common step in chlorophyll and heme biosynthesis is the oxidation of 
protoporphyrinogen IX to protoporphyrin IX. Protoporphyrinogen oxidase (refen^ed to herein 
as "protox") is the enzyme that catalyzes this last oxidation step (Matringe et al., Biochem. 
J. 260; 231 (1989)). 

The protox enzyme has been purified either partially or completely from a number of 
organisms Including the yeast Saccharomyces cerevisiae (Labbe-Bois and Labbe, In 
Blosynttiesis of Heme and Chlorophyll, E.H. Dailey, ed. McGraw Hill: New York, pp. 235-285 
(1990)). barley etioplasts (Jacobs and Jacobs, Biochem. J. 244:219 (1987)), and mouse 
liver (Dailey and Karr, Biochem. 26: 2697 (1987)). Genes encoding protox have been 
isolated from two prokaryotic organisms. Escherichia co// (Sasarman etal., Can. J. 
Microbiol. 39: 1 1 55 (1 993)) and Bacillus subtills (Dailey et ai, J. Biol. Chem. 269: 81 3 
(1994)). These genes share no sequence similarity; neither do their predicted protein 
products share any amino acid sequence identity. The E. co// protein is approximately 21 
kDa, and associates with the cell membrane. The B. subtills protein is 51 kDa, and is a 
soluble, cytoplasmic activity. 

Protox encoding genes have now also been isolated from humans (see Nishimura et 
al, J. Biol. Chem. 270(14): 8076-8080 (1995) and plants (International application no. 
PCT/IB95/00452 filed June 8, 1995, published Dec. 21, 1995 as WO 95/34659). 

II. The Protox Gene as a Herbicide Target 
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The use of herbicides to control undesirable vegetation such as weeds or plants in 
crops has become an almost universal practice. The relevant market exceeds a billion 
dollars annually. Despite this extensive use, weed control remains a significant and costly 
problem for farmers. 

Effective use of herbicides requires sound management. For instance, time and 
method of application and stage of weed plant development are critical to getting good 
weed control with herbicides. Since various weed species are resistant to herbicides, the 
production of effective herbicides becomes increasingly important. Novel herbicides can 
now be discovered using high-throughput screens that implement recombinant DMA 
technology. Metabolic enzymes essential to plant growth and development can be 
recombinantly produced though standard molecular biological techniques and utilized as 
herbicide targets in screens for novel inhibitors of the enzymes' activity. The novel inhibitors 
discovered through such screens may then be used as herbicides to control undesirable 
vegetation. 

Unfortunately, herbicides that exhibit greater potency, broader weed spectrum and 
more rapid degradation in soil can also have greater crop phytotoxicity. One solution 
applied to this problem has been to develop crops that are resistant or tolerant to 
herbicides. Crop hybrids or varieties resistant to the herbicides allow for the use of the 
herbicides without attendant risk of damage to the crop. Development of resistance can 
allow application of a heriDicide to a crop where its use was previously precluded or limited 
(e.g. to pre-emergence use) due to sensitivity of the crop to the hertDicide. For example, 
U.S. Patent No. 4,761 ,373, incorporated herein by reference, is directed to plants resistant 
to various imidazolinone or sulfonamide heri^icides. The resistance is conferred by an 
altered acetohydroxyacid synthase (AHAS) enzyme. U.S. Patent No. 4,975,374, 
incorporated herein by reference, relates to plant cells and plants containing a gene 
encoding a mutant glutamine synthetase (GS) resistant to inhibition by herbicides that were 
known to inhibit GS, e.g. phosphinothricin and methionine sulfoximine. U.S. Patent No. 
5,013,659, incorporated herein by reference, is directed to plants that express a mutant 
acetolactate synthase (ALS) that renders the plants resistant to inhibition by sulfonylurea 
herbicides. U.S. Patent No. 5,162,602, incorporated herein by reference, discloses plants 
tolerant to inhibition by cyclohexanedione and aryloxyphenoxypropanoic acid herbicides. 
The tolerance is conferred by an altered acetyl coenzyme A carboxylase(ACCase). U.S. 
Patent No. 5,554,798, incorporated herein by reference, discloses transgenic glyphosate 
resistant maize plants, which tolerance is confen^ed by an altered 5-enolpyruvyl-3- 
phosphoshiklmate (EPSP) synthase gene. 
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The protox enzyme serves as the target for a variety of herbicidal compounds. The 
herbicides that inhibit protox include many different structural classes of molecules (Duke et 
aL, Weed Sci. 39: 465 (1991); Nandihalli etaL, Pesticide Biocliem. Physiol. 43: 193 (1992); 
Matringe et al., FEBS Lett. 245: 35 (1989); Yanase and Andoh, Pesticide Biochem. Physiol. 
35: 70 (1989)). These herbicidal compounds include the diphenylethers {e.g. acifluorfen, 5- 
[2-chloro-4-(trif luoromethyl)phenoxy]-2-nitrobezoic acid; its methyl ester; or oxyfluorfen, 2- 
chloro-1-(3-ethoxy-4-nitrophenoxy)-4-(trifluoroben2ene)}, oxidiazoles, (e.g. oxidiazon, 3-[2,4- 
dichloro-5-(1 -methylethoxy)phenyl]-5-(1 ,1 -dimethylethyl)-1 ,3,4-oxadiazol-2-(3H)-one), cyclic 
imides (e.g. S-23142,A/-(4-chloro-2-fluoro-5-propargyloxyphenyl)-3,4,5,6- 
tetrahydrophthalimide; chlorophthalim. W-(4-chlorophenyl)-3.4,5,6-tetrahydrophthalimide), 
phenyl pyrazoles (e.g. TNPP-ethyl. ethyl 2-[1-(2.3.4-trichlorophenyl)-4-nitropyrazolyl-5- 
oxy]propionate; M&B 39279). pyridine derivatives (e.g. LS 82-556), and phenopylate and its 
Ophenylpyn-olidino- and piperidinocarbamate analogs. Many of these compounds 
competitively inhibit the normal reaction catalyzed by the enzyme, apparently acting as 
substrate analogs. 

Typically, the inhibitory effect on protox is determined by measuring fluorescence at 
about 622 to 635 nm, after excitation at about 395 to 410 nM (see, e.g. Jacobs and Jacobs. 
Enzyme 28: 206 (1 982); Sherman et al., Plant Physiol. 97: 280 (1 991 )). This assay is 
based on the fact that protoporphyrin IX is a fluorescent pigment, and protoporphyrinogen 
IX is nonfluorescent. 

The predicted mode of action of protox-inhibiting herbicides involves the 
accumulation of protoporphyrinogen IX in the chloroplast. This accumulation is thought to 
lead to leakage of protoporphyrinogen IX into the cytosol where it is oxidized by a 
peroxidase activity to protoporphyrin IX. When exposed to light, protoporphyrin IX can 
cause fomiation of singlet oxygen in the cytosol. This singlet oxygen can in tum lead to the 
formation of other reactive oxygen species, which can cause lipid peroxidation and 
membrane disruption leading to rapid cell death (Lee etal., Plant Physiol. 102. 881 (1993)). 

Not all protox enzymes are sensitive to herbicides that inhibit plant protox enzymes. 
Both of the protox enzymes encoded by genes isolated from Escherichia co// (Sasarman et 
al., Can. J. Microbiol. 39: 1 155 (1993)) and Bacillus subtilis (Dailey etal., J. Biol. Chem. 
269. 813 (1994)) are resistant to these herbicidal inhibitors. In addition, mutants of the 
unicellular alga Chlamydomonas relnhardtii resistant to the phenylimide herbicide S-231 42 
have been reported (Kataoka etal., J. Pesticide Sci. 75:449 (1990); Shibata etal., In 
Research in Photosynthesis, Vol. ill, N. Murata, ed. KluwenNethertands. pp. 567-570 
(1992)). At least one of these mutants appears to have an altered protox activity that is 
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resistant not only to the herbicidal inhibitor on which the mutant was selected, but also to 
other classes of protox inhibitors (Oshio et al, Z Naturforsch. 48c: 339 (1 993); Sato et aL, 
In ACS Symposium on Porphyric Pesticides, S. Duke, ed. ACS Press: Washington. D.C. 
(1994)). A mutant tobacco ceil line has also been reported that is resistant to the inhibitor 
S-21432 (Che ef a/.,. Z Naturforsch. 48c: 350 (1993). 

III. Plastid Transformation and Expression 

Plastid transfonmation, in which genes are inserted by homologous recombination 
into some or all of the several thousand copies of the circular plastid genome present in 
each plant cell, takes advantage of the enomious copy number advantage over nuclear- 
expressed genes to pemfiit expression levels that may exceed 10% of the total soluble plant 
protein. In addition, plastid transformation is desirable because in most plants plastid- 
encoded traits are not pollen transmissible; hence, potential risks of inadvertent transgene 
escape to wild relatives of transgenic plants is obviated. Plastid transformation technology is 
extensively described in U.S. Patent Nos. 5,451,513, 5,545,817, 5,545,818, and 5,576,198; 
in PCT Application Nos. WO 95/1 6783 and WO 97/32977; and in McBride et a!., Proc. Natl. 
Acad. Sci. USA 91: 7301-7305 (1994), all of which are incorporated herein by reference. 
Plastid transformation via biolistics was achieved initially in the unicellular green alga 
Chlamydomonas reinhardtil (BoynXon etal. (1988) Science 240: 1534-1537, incorporated 
herein by reference) and this approach, using selection for c/s-acting antibiotic resistance 
loci (spectinomycin/streptomycin resistance) or complementation of non-photosynthetic 
mutant phenotypes, was soon extended to Nicotiana tabacum (Svab etal. (1990) Proc. 
Nati. Acad. ScL USA 87: 8526-8530, incorporated herein by reference). 

The basic technique for tobacco chloroplast transformation involves the particle 
bombardment of leaf tissue or PEG-mediated uptake of plasmid DNA in protoplasts with 
regions of cloned plastid DNA flanking a selectable antibiotic resistance marker. The 1 to 
1.5 kb flanking regions, termed "targeting sequences," facilitate homologous recombination 
with the plastid genome and thus allow the replacement or modification of specific regions 
of the 156 kb tobacco plastid genome. Initially, point mutations in the chloroplast 16S rDNA 
and rps12 genes confening resistance to spectinomycin and/or streptomycin were utilized 
as selectable markers for transformation (Svab, Z., Hajdukiewicz, P., and Maliga, P. (1990) 
Proc. Natl. Acad. ScL USA 87. 8526-8530; Staub, J. M., and Maliga, P. (1992) Plant Ceil 4, 
39-45, incorporated herein by reference). This resulted in stable homoplasmic 
transformants at a frequency of approximately one per 100 bombardments of target leaves. 
The presence of cloning sites between these markers allowed creation of a plastid targeting 
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vector for introduction of foreign genes (Staub, J.M., and Maliga, P., EMBOJ. 12: 601-606 
(1993), incorporated herein by reference). Substantial increases in transformation 
frequency were obtained by replacement of the recessive rRNA or r-protein antibiotic 
resistance genes with a dominant selectable marker, the bacterial aadA gene encoding the 
spectinomycin-detoxifying enzyme aminoglycoside-3'-adenyltransf erase (Svab, Z., and 
Maliga, P. (1993) Proa Natl. Acad. ScL USA 90. 913-917, incorporated herein by 
reference). Previously, this marl<er had been used successfully for high-frequency 
transformation of the plastid genome of the green alga Chlamydomonas reinhardtii 
(Goldschmidt-Clennont, M. (1991) NucL Acids Res. 19. 4083-4089, incorporated herein by 
reference). Recently, plastid transformation of protoplasts from tobacco and the moss 
Physcomitrella patens has been attained using polyethylene glycol (PEG) mediated DNA 
uptake (O'Neill etal. (1993) Plant J. 3: 729-738; Koop etal. (1996) Planta 199: 193-201, 
both of which are incorporated herein by reference). 

The present invention provides DNA molecules isolated from maize and comprising 
the coding sequence of an enzyme having protoporphyrinogen oxidase (protox) activity. 
In particular, the coding sequence of a mature maize protoporphyrinogen oxidase is set 
forth in SEQ ID NO: 44, and the corresponding amino acid sequence for a mature maize 
protoporphyrinogen oxidase enzyme is set forth in SEQ ID NO:45. The present invention 
further provides chimeric genes comprising such DNA molecules. 

The present invention also provides DNA molecules isolated from wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, and sugar cane encoding enzymes having 
protoporphyrinogen oxidase (protox) activity and chimeric genes comprising such DNA. 
Sequences of such DNA molecules are set forth in SEQ ID NOs: 9 (wheat). 11 (soybean), 
15 (cotton), 17 (sugar beet), 19 (oilseed rape), 21 (rice). 23 (sorghum), and 36 (sugarcane). 

The present invention also provides modified forms of plant protoporphyrinogen 
oxidase (protox) enzymes that are resistant to compounds that inhibit unmodified naturally 
occurring plant protox enzymes, and DNA molecules coding for such inhibitor-resistant plant 
protox enzymes. Thus, in one aspect the present invention provides a DNA molecule 
encoding a plant protox enzyme that is capable of being incorporated into a DNA construct 
used to transform a plant containing wild-type, heriDicide-sensitive protox, wherein the DNA 
molecule has at least one point mutation relative to a wild-type DNA molecule encoding 
plant protox such that upon transfonmation with the DNA construct the plant contains the 
DNA molecule, which renders the plant resistant to the application of a herbicide that 
inhibits naturally occurring plant protox. The present invention includes chimeric genes and 
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modified forms of naturally occurring protox genes that can express the inhibitor-resistant 
plant protox enzymes in plants. 

Genes encoding inhibitor-resistant plant protox enzymes can be used to confer 
resistance to protox-inhibitory herbicides in whole plants and as a selectable marker in plant 
cell transformation methods. Accordingly, the present invention also includes plants, 
including the descendants thereof, plant tissues and plant seeds containing plant 
expressible genes encoding these modified protox enzymes. These plants, plant tissues 
and plant seeds are resistant to protox-inhibitors at levels that normally are inhibitory to the 
naturally occuning protox activity in the plant. Plants encompassed by the invention 
especially include those that would be potential targets for protox inhibiting herbicides, 
particularly agronomically important crops such as maize and other cereal crops such as 
barley, wheat, sorghum, rye, oats, turf and forage grasses, millet and rice. Also comprised 
are other crop plants such as sugar cane, soybean, cotton, sugar beet, oilseed rape and 
tobacco. 

The present invention accordingly provides a method for selecting plant cells 
transfomried with a DNA molecule of the invention that encodes a herbicide-tolerant form of 
plant protox. The method comprises introducing the DNA molecule into plant cells whose 
growth is sensitive to inhibition by herbicides to which the protox encoded by the DNA 
molecule is resistant, thus forming a transformed plant cell. The transformed plant cell 
whose growth is resistant to the selected herbicide is identified by selection at a herbicide 
concentration that inhibits the growth of untransformed plant cells. 

The present invention is directed further to methods for the production of plants, 
including plant material, such as for example plant tissues, protoplasts, cells, calli, organs, 
plant seeds, embryos, pollen, egg cells, zygotes, together with any other propagating 
material and plant parts, such as for example flowers, stems, fruits, leaves, roots originating 
in transgenic plants or their progeny previously transformed by means of the process of the 
invention, which produce an inhibitor-resistant form of the plant protox enzyme provided 
herein. Such plants may be stably transfonned with a structural gene encoding the 
resistant protox, or prepared by direct selection techniques whereby heri^icide resistant lines 
are isolated, characterized and developed. 

In another aspect, the present Invention is directed to a method for controlling 
unwanted vegetation growing at a locus where a hertDicide-tolerant, agronomically useful 
plant, which is transfomied with a DNA molecule according to the present invention that 
encodes a herbicide-tolerant fomn of plant protox, has been cultivated. The method 
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comprises applying to the locus to be protected an effective amount of herbicide that 
inhibits naturally occumng protox activity. 

The present invention is further directed to probes and methods for detecting the 
presence of genes encoding inhibitor-resistant forms of the plant protox enzyme and 
quantitating levels of inhibitor-resistant protox transcripts in plant tissue. These methods 
may be used to identify or screen for plants or plant tissue containing and/or expressing a 
gene encoding an inhibitor-resistant form of the plant protox enzyme. 

The present invention also relates to plastid transfomiation and to the expression of 
DNA molecules in a plant plastid. In a prefen^ed embodiment, a native plant protox enzyme 
or a modified plant protox enzyme is expressed in plant plastids to obtain herbicide resistant 
plants. 

In a further embodiment, the present invention is directed to a chimeric gene 
comprising: (a) a DNA molecule isolated from a plant, which in its native state encodes a 
polypeptide that comprises a plastid transit peptide, and a mature enzyme that is natively 
targeted to a plastid of the plant by the plastid transit peptide, wherein the DNA molecule is 
modified such that it does not encode a functional plastid transit peptide; and (b) a promoter 
capable of expressing the DNA molecule in a plastid, wherein the promoter is operatively 
linked to the DNA molecule. The DNA molecule may be modified in that at least a portion of 
the native plastid transit peptide coding sequence is absent from the DNA molecule. 
Alternatively, the DNA molecule may be modified in that one or more nucleotides of the 
native plastid transit peptide coding sequence are mutated, thereby rendering an encoded 
plastid transit peptide nonfunctional. The present invention also relates to plants 
homoplasmic for chloroplast genomes containing such chimeric genes. In a prefen-ed 
embodiment, the DNA molecule encodes an enzyme that is naturally inhibited by a 
herbicidal compound. In this case, such plants are resistant to a herbicide that naturally 
inhibits the enzyme encoded by a DNA molecule according to the present invention. 

The present invention is also directed to plants made resistant to a herbicide by 
transforming their plastid genome with a DNA molecule according to the present invention 
and to methods for obtaining such plants. In a preferred embodiment, the DNA molecule 
encodes an enzyme that is naturally inhibited by a herbicidal compound. In a more 
preferred embodiment, the DNA molecule encodes an enzyme having protoporphyrinogen 
oxidase (protox) activity, which is modified so that it that confers resistance to protox 
inhibitors. A further embodiment of the present invention is directed to a method for 
controlling the growth of undesired vegetation, which comprises applying to a population of 
the above-described plants an effective amount of an inhibitor of the enzyme. 
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The present invention also provides a novel method for selecting a transplastomic 
plant cell, comprising the steps of: introducing the above-described chimeric gene into the 
plastome of a plant cell; expressing the encoded enzyme in the plastids of the plant cell; 
and selecting a cell that is resistant to a herbicidal compound that naturally inhibits the 
activity of the enzyme, whereby the resistant cell comprises transformed plastids. In a 
prefenred embodiment, the enzyme is naturally inhibited by a herbicidal compound and the 
transgenic plant is able to grow on an amount of the hert^icidal compound that naturally 
inhibits the activity of the enzyme. In a further prefen-ed embodiment, the enzyme has 
protoporphyrinogen oxidase (protox) activity and is modified so that it that confers 
resistance to protox inhibitors. 

Brief description of the Sequence Listing: 

SEQ ID N0:1 : DNA coding sequence for an Arabidopsis thaliana protox-1 protein. 

SEQ ID N0:2: Arabidopsis protox-1 amino acid sequence encoded by SEQ ID NO:1. 

SEQ ID N0:3: DNA coding sequence for an Arabidopsis thaliana protox-2 protein. 

SEQ ID N0:4: Arabidopsis protox-2 amino acid sequence encoded by SEQ ID N0:3. 

SEQ ID N0:5: Partial DNA coding sequence for a maize protox-1 protein. 

SEQ ID N0:6: Partial maize protox-1 amino acid sequence encoded by SEQ ID NO:5. 

SEQ ID N0:7: DNA coding sequence for a maize protox-2 protein. 

SEQ ID N0:8: Maize protox-2 amino acid sequence encoded by SEQ ID N0:7. 

SEQ ID N0:9: Partial DNA coding sequence for a wheat protox-1 protein. 

SEQ ID N0:10: Partial wheat protox-1 amino acid sequence encoded by SEQ ID N0:9. 

SEQ ID N0:1 1 : DNA coding sequence for a soybean protox-1 protein. 

SEQ ID N0:12: Soybean protox-1 protein encoded by SEQ ID NO:1 1 . 

SEQ ID N0:13: Promoter sequence from Arabidopsis thaiiana protox-1 gene. 

SEQ ID N0:14: Promoter sequence from maize protox-1 gene. 

SEQ ID N0:15: DNA coding sequence for a cotton protox-1 protein. 

SEQ ID N0:16: Cotton protox-1 amino acid sequence encoded by SEQ ID NO:15. 

SEQ ID N0:17: DNA coding sequence for a sugar beet protox-1 protein. 

SEQ ID N0:18: Sugar beet protox-1 amino acid sequence encoded by SEQ ID NO:17. 

SEQ ID N0:19: DNA coding sequence for an oilseed rape protox-1 protein. 

SEQ ID N0:20: Oilseed rape protox-1 amino acid sequence encoded by SEQ ID N0:19. 

SEQ ID N0:21: Partial DNA coding sequence for a rice protox-1 protein. 

SEQ ID N0:22: Partial rice protox-1 amino acid sequence encoded by SEQ ID N0:21. 

SEQ ID N0:23: Partial DNA coding sequence for a sorghum protox-1 protein. 
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SEQ ID N0:24: Partial sorghum protox-1 amino acid sequence encoded by SEQ ID NO:23. 

SEQ ID NO:25: Maize protox-1 intron sequence. 

SEQ ID NO:26: Promoter sequence from sugar beet protox-1 gene. 

SEQ ID NO:27: Pclp„P1a - plastid c/pP gene promoter top strand PGR primer. 

SEQ ID NO:28: Pclp_P1b - plastid cIpP gene promoter bottom strand PGR primer. 

SEQ ID NO:29: Pclp_P2b - plastid c/pP gene promoter bottom strand PGR primer. 

SEQ ID NO:30: Trps16_P1a - plastid rps16 gene top strand PGR primer. 

SEQ ID N0:31 : Trps16_p1b - plastid rpsi 6 gene bottom strand PGR primer. 

SEQ ID N0:32: minpsb_U - plastid psbA gene top strand primer. 

SEQ ID NO:33: minpsb_L - plastid psM gene bottom strand primer. 

SEQ ID N0:34: APRTXPIa - top strand PGR primer. 

SEQ ID NO:35: APRTXPIb - bottom strand PGR primer. 

SEQ ID NO:36: Partial DNA coding sequence for a sugar cane protox-1 protein. 

SEQ ID NO:37: Partial sugar cane protox-1 amino acid sequence encoded by SEQ ID NO:36. 

SEQ ID NO:38: Sub-sequence #1 (APAiF). 

SEQ ID NO:39: Sub-sequence #8 (YIGGAe). 

SEQ ID N0:40: Sub-sequence #12 (IGGA12). 

SEQ ID N0:41 : Sub-sequence #13 (SWXLA13). 

SEQ ID NO:42: Sub-sequence #15 (GAisXGL). 

SEQ ID N0:43: Sub-sequence #17 (YVA17G). 

SEQ ID NO:44: DNA coding sequence for a mature maize protox-1 protein. 
SEQ ID NO:45: Mature maize protox-1 amino acid sequence encoded by SEQ ID NO:44. 
SEQ ID NO:46: Predicted coding sequence for a maize protox-1 polypeptide precursor. 
SEQ ID N0:47: Predicted amino acid sequence of a maize protox-1 polypeptide precursor 
encoded by SEQ ID NO:46. 

The following vector molecules have been deposited with Agricultural Research 
Service, Patent Gulture Gollection (NRRL), Northern Regional Research Center, 1815 North 
University Street, Peoria, Illinois 61604, U.S.A on the dates indicated below: 

Wheat protox-1 a, in the pBluescript SK vector, was deposited March 19, 1996, as 
pWDG-13 (NRRL #621545). 

Soybean protox-1, in the pBluescript SK vector, was deposited December 15, 1995 
as pWDG-12 (NRRL #8-21516). 
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Cotton protox-1 , in the pBluescript SK vector, was deposited July 1 , 1 996 as pWDC- 
15(NRRL#B-21594). 

Sugar beet protox-1, in the pBluescript SK vector, was deposited July 29, 1996, as 
pWDC-1 6 (NRRL #B-21 595N). 

Oilseed rape protox-1 , in the pBluescript SK vector, was deposited August 23, 1 996, 
as pWDC-17 (NRRL #B-21615). 

Rice protox-1 , in the pBluescript SK vector, was deposited December 6, 1996, as 
pWDC-18 (NRRL #B-21648). 

Sorghum protox-1 , in the pBluescript SK vector, was deposited December 6, 1996, 
as pWDC-19 (NRRL #B-21649). 

Resistant mutant pAraC-2Cys, in the pMut-1 plasmid, was deposited on November 
14, 1994 under the designation pWDC-7 with the Agricultural Research Culture Collection 
and given the deposit designation NRRL #21339N. 

AraPTI Pro containing the Arabidopsis protox-1 promoter was deposited December 
15, 1995, as pWDC-11 (NRRL#B-21515) 

A plasmid containing the maize protox-1 promoter fused to the remainder of the 
maize protox-1 coding sequence was deposited March 19, 1996 as pWDC-14 (NRRL #B- 
21546). 

A plasmid containing the sugar beet protox-1 promoter was deposited December 6. 
1996, as pWDC-20 (NRRL #B-21650). 

For clarity, certain terms used in the specification are defined and presented as 

follows: 

Associated With / Operatively Linked: refers to two DNA sequences that are related 
physically or functionally. For example, a promoter or regulatory DNA sequence is said to 
be "associated with" a DNA sequence that codes for an RNA or a protein if the two 
sequences are operatively linked, or situated such that the regulator DNA sequence will 
affect the expression level of the coding or stmctural DNA sequence. 

Chimeric Gene: a recombinant DNA sequence in which a promoter or regulatory 
DNA sequence is operatively linked to, or associated with, a DNA sequence that codes for 
an mRNA or which is expressed as a protein, such that the regulator DNA sequence is able 
to regulate transcription or expression of the associated DNA sequence. The regulator 
DNA sequence of the chimeric gene is not nomially operatively linked to the associated 
DNA sequence as found in nature. 
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Coding DNA Sequence: a DNA sequence that is translated in an organism to 
produce a protein. 

Corresponding To: in the context of the present invention, "corresponding to" means 
that when the amino acid sequences of various protox enzymes are aligned with each 
other, such as fn Table 1 A, the amino acids that "correspond to" certain enumerated 
positions in Table 1 A are those that align with these positions in Table 1 A, but that are not 
necessarily in these exact numerical positions relative to the particular protox enzyme's 
amino acid sequence. Likewise, when the amino acid sequence of a particular protox 
enzyme (for example, the soybean protox enzyme) is aligned with the amino acid sequence 
of a reference protox enzyme (for example, the Arabidopsis protox-1 sequence given in 
SEQ ID N0:2), the amino acids in the soybean protox sequence that "correspond to" certain 
enumerated positions of SEQ ID NO:2 are those that align with these positions of SEQ ID 
N0:2, but are not necessarily in these exact numerical positions of the soybean protox 
enzyme's amino acid sequence. 

HeriDicide: a chemical substance used to kill or suppress the growth of plants, plant 
cells, plant seeds, or plant tissues. 

Heterologous DNA Sequence: a DNA sequence not naturally associated with a host 
cell into which it is introduced, including non-naturally occurring multiple copies of a 
naturally occurring DNA sequence. 

Homologous DNA Sequence: a DNA sequence naturally associated with a host cell 
into which it is introduced. 

Homoplasmic: refers to a plant, plant tissue or plant cell, wherein all of the plastids 
are genetically identical. In different tissues or stages of development, the plastids may take 
different fomis, e.g., chloroplasts, proplastids, etioplasts, amyloplasts, chromoplasts, and so 
forth. 

Inhibitor: a chemical substance that inactivates the enzymatic activity of a protein 
such as a blosynthetic enzyme, receptor, signal transduction protein, stmctural gene 
product, or transport protein that is essential to the growth or sun^ival of the plant. In the 
context of the instant invention, an inhibitor is a chemical substance that inactivates the 
enzymatic activity of protox. The temri "heri^icide" is used herein to define an inhibitor when 
applied to plants, plant cells, plant seeds, or plant tissues. 

Isolated: in the context of the present invention, an isolated nucleic acid molecule or 
an isolated enzyme is a nucleic acid molecule or enzyme that, by the hand of man, exists 
apart from its native environment and is therefore not a product of nature. An isolated 
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nucleic acid molecule or enzyme may exist in a purified fonn or may exist in a non-native 
environment such as, for example, a transgenic host cell. 

Minimal Promoter promoter elements, particularly a TATA element, that are inactive 
or that have greatly reduced promoter activity in the absence of upstream activation. In the 
presence of a suitable transcrption factor, the minimal promoter functions to permit 
transcription. 

Modified Enzyme Activity: enzyme activity different from that which naturally occurs 
in a plant (i.e. enzyme activity that occurs naturally in the absence of direct or indirect 
manipulation of such activity by man), which is tolerant to inhibitors that inhibit the naturally 
occum'ng enzyme activity. 

Nucleic Acid Molecule: a linear segment of single- or double-stranded DNA or RNA 
that can be isolated from any source. In the context of the present invention, the nucleic 
acid molecule is preferably a segment of DNA. 

Plant: refers to any plant or part of a plant at any stage of development. Therein are 
also included cuttings, cell or tissue cultures and seeds. As used in conjunction with the 
present invention, the term "plant tissue" includes, but is not limited to, whole plants, plant 
cells, plant organs, plant seeds, protoplasts, callus, cell cultures, and any groups of plant 
cells organized into structural and/or functional units. 

Plastome: the genome of a plastid. 

Protox-1 : chloroplast protox. 

Protox-2: mitochondrial protox. 

Significant increase: an increase in enzymatic activity that is larger than the margin 
of en-or inherent in the measurement technique, preferably an increase by about 2-fold or 
greater of the activity of the wild-type enzyme in the presence of the inhibitor, more 
preferably an increase by about 5-fold or greater, and most preferably an Increase by about 
10-fold or greater. 

Substantially Similar: with respect to nucleic acids, a nucleic acid molecule that has 
at least 60 percent sequence identity with a reference nucleic acid molecule. In a preferred 
embodiment, a substantially similar DNA sequence is at least 80% Identical to a reference 
DNA sequence; in a more preferred embodiment, a substantially similar DNA sequence is at 
least 90% identical to a reference DNA sequence; and in a most preferred embodiment, a 
substantially similar DNA sequence is at least 95% identical to a reference DNA sequence. 
A substantially similar nucleotide sequence typically hybridizes to a reference nucleic acid 
molecule, or fragments thereof, under the following conditions: hybridization at 7% sodium 
dodecyl sulfate (SDS), 0.6 M NaP04 pH 7.0, 1 mM EDTA at 50*C; wash with 2X SSC, 1% 
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SDS, at 50°C. With respect to proteins or peptides, a substantially similar amino acid 
sequence is an amino acid sequence that Is at least 90% identical to the amino acid 
sequence of a reference protein or peptide and has substantially the same activity as the 
reference protein or peptide. 

Tolerance / Resistance: the ability to continue nonnal growth or function when 
exposed to an inhibitor or herbicide. 

Transformation: a process for introducing heterologous DNA into a cell, tissue, or 
plant. Transf omied cells, tissues, or plants are understood to encompass not only the end 
product of a transfonfnation process, but also transgenic progeny thereof. 

Transit Peptide: a signal polypeptide that is translated in conjunction with a protein 
encoded by a DNA molecule, fomiing a polypeptide precursor. In the process of transport to 
a selected site within the cell, a chloroplast for example, the transit peptide can be cleaved 
from the remainder of the polypeptide precursor to provide an active or mature protein. 

Transfonned: refers to an organism such as a plant into which a heterologous DNA 
molecule has been introduced. The DNA molecule can be stably integrated into the genome 
of the plant, wherein the genome of the plant encompasses the nuclear genome, the plastid 
genome and the mitochondrial genome. In a transfonmed plant, the DNA molecule can also 
be present as an extrachromosomal molecule. Such an extrachromosomal molecule can be 
auto-replicating. A "non-transfornied" plant refers to a wild-type organism, i.e., a plant, 
which does not contain the heterologous DNA molecule. 

Transplastome: a transfonned plastid genome. 

Nucleotides are indicated by their bases by the following standard abbreviations: 
adenine (A), cytosine (C). thymine (T), and guanine (G). Amino acids are likewise indicated 
by the following standard abbreviations: alanine (ala; A), arginine (Arg; R), asparagine (Asn; 
N) , aspartic acid (Asp; D), cysteine (Cys; C), glutamine (Gin; Q), glutamic acid (Glu; E), 
glycine (Gly; G), histidine (His; H), isoleucine (He; I), leucine (Leu; L), lysine (lys; K), 
methionine (Met; M), phenylalanine (Phe; F). proline (Pro; P). serine (Ser; S), threonine 
(Thr; T), tryptophan (Trp; W). tyrosine (Tyr; Y), and valine (Val; V). Furthermore (Xaa; X) 
represents any amino acid. 

I. Plant Protox Coding Sequences ' .. 

In one aspect, the inventors of the present invention have detennlned for the first 
time the NH2 terminus of the mature maize protoporphyrin oxidase. To achieve this result, 
maize etioplasts were isolated and the protoporphyrin oxidase was immunoprecipitated from 
the etioplast extract using anti-protoporphyrin oxidase antibodies covalently bound to 
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Protein A Sepharose resin. The NH2 terminus of the mature maize protoporphyrin oxidase 
was determined by microsequencing and found to be located at the alanine at position 36 in 
the amino acid sequence set forth in SEQ ID NO:47. This is an unexpected and surprising 
result since using the software program ChloroP (Nielsen, H., Engelbrecht, J., Bainak, S., & 
von Heijne. G. (1997) Prot Engineer. 10, 1-6), the NH2 terminus of the mature protein was 
predicted at the amino acid corresponding to the cystein at position 35 in the amino acid 
sequence set forth in SEQ ID NO:47. The present invention therefore discloses a novel and 
non-obvious DNA molecule comprising the mature coding sequence of the maize 
protoporphyrin oxidase (protox). The nucleotide sequence of the coding sequence of the 
mature maize protoporphyrin oxidase (protox) is provided in SEQ ID NO:44 and the 
corresponding amino acid sequence is set forth in SEQ ID NO:45. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes protoporphyrlnogen oxidase (referred to herein as "protox"), the enzyme that 
catalyzes the oxidation of protoporphyrlnogen IX to protoporphyrin IX, from wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, and sugar cane. The partial DNA coding 
sequence and corresponding amino acid sequence for a wheat protox enzyme are provided 
as SEQ ID NOs:9 and 10, respectively. The DNA coding sequence and corresponding 
amino acid sequence for a soybean protox enzyme are provided as SEQ ID NOs:1 1 and 
12, respectively. The DNA coding sequence and corresponding amino acid sequence for a 
cotton protox enzyme are provided as SEQ ID N0s:15 and 16, respectively. The DNA 
coding sequence and corresponding amino acid sequence for a sugar beet protox enzyme 
are provided as SEQ ID NOs:17 and 18, respectively. The DNA coding sequence and 
corresponding amino acid sequence for an oilseed rape protox enzyme are provided as 
SEQ ID N0s:19 and 20, respectively. The partial DNA coding sequence and con-esponding 
amino acid sequence for a rice protox enzyme are provided as SEQ ID N0s:21 and 22, 
respectively. The partial DNA coding sequence and con-esponding amino acid sequence 
for a sorghum protox enzyme are provided as SEQ ID NOs:23 and 24, respectively. The 
partial DNA coding sequence and con-esponding amino acid sequence for a sugar cane 
protox enzyme are provided as SEQ ID NOs:36 and 37, respectively. 

The DNA coding sequences and corresponding amino acid sequences for protox 
enzymes from Arabidopsis thaliana and maize are provided herein as SEQ ID N0s:1-4 
{Arabidopsis) and SEQ ID NOs:5-8 (maize). 

The invention therefore is directed to a DNA molecule encoding a 
protoporphyrlnogen oxidase (protox) comprising a eukaryotic protox selected from the 
group consisting of a wheat protox enzyme, a soybean protox enzyme, a cotton protox 
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enzyme, a sugar beet protox enzyme, an oilseed rape protox enzyme, a rice protox 
enzyme, a sorghum protox enzyme, and a sugar cane protox enzyme. 

Prefenred within the scope of the invention are isolated DNA molecules encoding the 
protoporphyrinogen oxidase (protox) enzyme from dicotyledonous plants, but especially 
from soybean plants, cotton plants, sugar beet plants and oilseed rape plants, such as 
those given in SEQ ID NOS: 11, 15, 17 and 19. More preferred are isolated DNA molecules 
encoding the protoporphyrinogen oxidase (protox) enzyme from soybean, such as given in 
SEQ ID N0:1 1 , and sugar beet, such as given In SEQ ID NO:17. 

Also preferred are isolated DNA molecules encoding the protoporphyrinogen 
oxidase (protox) enzyme from monocotyledonous plants, but especially from wheat plants, 
rice plants, sorghum plants, and sugar cane plants, such as those given in SEQ ID NOS: 9, 
21, 23, and 36. More preferred are isolated DNA molecules encoding the 
protoporphyrinogen oxidase (protox) enzyme from wheat such as given in SEQ ID N0:9. 

In another aspect, the present invention is directed to isolated DNA molecules 
encoding the protoporphyrinogen oxidase (protox) enzyme protein from a dicotyledonous 
plant, wherein the protein comprises the amino acid sequence selected from the group 
consisting of SEQ ID NOs: 12, 16, 18 and 20. Further comprised are isolated DNA 
molecules encoding the protoporphyrinogen oxidase (protox) enzyme protein from a 
monocotyledonous plant, wherein the protein comprises the amino acid sequence selected 
from the group consisting of SEQ ID NOs: 10, 22, 24, and 37. More preferred is an isolated 
DNA molecule encoding the protoporphyrinogen oxidase (protox) enzyme wherein the 
protein comprises the amino acid sequence from wheat such as given in SEQ ID NO:10. 
More prefenred is an isolated DNA molecule encoding the protoporphyrinogen oxidase 
(protox) enzyme wherein the protein comprises the amino acid sequence from soybean, 
such as given in SEQ ID NO:12 and sugar beet, such as given in SEQ ID N0:18. 

Using the information provided by the present invention, the DNA coding sequence 
for the protoporphyrinogen oxidase (protox) enzyme from any eul<aryotic organism may be 
obtained using standard methods. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a wheat protox enzyme and that comprises a nucleotide sequence that hybridizes 
to the coding sequence shown in SEQ ID N0:9 under the following hybridization and wash 
conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 
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In yet another aspect, the present invention is directed to an isolated DNA molecule 
that encodes a soybean protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:1 1 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at So"* C; and 

(b) wash in 2X SSC. 1 % SDS at 50° C. 

In still another aspect, the present Invention is directed to an isolated DNA molecule 
that encodes a cotton protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID N0:15 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0. 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1% SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a sugar beet protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID N0:17 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0. 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes an oilseed rape protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID N0:19 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C;and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a rice protox enzyme and that comprises a nucleotide sequence that hybridizes to 
the coding sequence shown in SEQ ID NO:21 under the following hybridization and wash 
conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC. 1 % SDS at 50° C. 
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In another aspect, the present invention is directed to an Isolated DNA molecule that 
encodes a sorghum protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:23 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM 
EDTA at so"" C; and 

(b) wash in 2X SSC, 1 % SDS at 50** C. 

In another aspect, the present invention is directed to an isolated DNA molecule that 
encodes a sugar cane protox enzyme and that comprises a nucleotide sequence that 
hybridizes to the coding sequence shown in SEQ ID NO:36 under the following 
hybridization and wash conditions: 

(a) hybridization in 7% sodium dodecyl sulfate (SDS), 0,5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C; and 

(b) wash in 2X SSC, 1 % SDS at 50° C. 

The isolated eukaryotic protox sequences taught by the present invention may be 
manipulated according to standard genetic engineering techniques to suit any desired 
purpose. For example, the entire protox sequence or portions thereof may be used as 
probes capable of specifically hybridizing to protox coding sequences and messenger 
RNA's. To achieve specific hybridization under a variety of conditions, such probes include 
sequences that are unique among protox coding sequences and are preferably at least 10 
nucleotides in length, and most preferably at least 20 nucleotides in length. Such probes 
may be used to amplify and analyze protox coding sequences from a chosen organism via 
the well known process of polymerase chain reaction (PGR). This technique may be useful 
to isolate additional protox coding sequences from a desired organism or as a diagnostic 
assay to determine the presence of protox coding sequences in an organism. 

Factors that affect the stability of hybrids determine the stringency of the 
hybridization. One such factor is the melting temperature Tm, which can be easily calculated 
according to the formula provided in DNA PROBES, George H. Keller and Mart( M. Manak , 
IVIacmillan Publishers Ltd, 1993. Section one: Molecular Hybridization Technology; page 8 
ff. 

The preferred hybridization temperature is in the range of about 25**C below the calculated 
melting temperature Tm and preferably in the range of about 12-15*^0 below the calculated 
melting temperature Tm and In the case of oligonucleotides in the range of about 5-1 0^*0 
below the melting temperature T^. 
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Comprised by the present invention are DNA molecules that hybridize to a DMA 
molecule according to the invention as defined hereinbefore, but preferably to an 
oligonucleotide probe obtainable from the DNA molecule comprising a contiguous portion of 
the sequence of the protoporphyrinogen oxidase (protox) enzyme at least 10 nucleotides in 
length, under moderately stringent conditions. 

The invention further embodies the use of a nucleotide probe capable of specifically 
hybridizing to a plant protox gene or mRNA of at least 10 nucleotides length in a 
polymerase chain reaction (PGR). 

In a further embodiment, the present invention provides probes capable of 
specifically hybridizing to a eukaryotic DNA sequence encoding a protoporphyrinogen 
oxidase activity or to the respective mRNA and methods for detecting the DNA sequences 
in eukaryotic organisms using the probes according to the invention. 

Protox specific hybridization probes may also be used to map the location of the 
native eukaryotic protox gene(s) in the genome of a chosen organism using standard 
techniques based on the selective hybridization of the probe to genomic protox sequences. 
These techniques include, but are not limited to, identification of DNA polymorphisms 
identified or contained within the protox probe sequence, and use of such polymorphisms to 
follow segregation of the protox gene relative to other markers of known map position in a 
mapping population derived from self fertilization of a hybrid of two polymorphic parental 
lines (see e.g. Helentjaris ef a/., Plant MoL Biol. 5: 109 (1985). Sommer etal. Biotechniques 
12:82 (1992); D'Ovidio etal.. Plant Mol. Biol. 15: 169 (1990)). While any eukaryotic protox 
sequence is contemplated to be useful as a probe for mapping protox genes from any 
eukaryotic organism, preferred probes are those protox sequences from organisms more 
closely related to the chosen organism, and most preferred probes are those protox 
sequences from the chosen organism. Mapping of protox genes in this manner is 
contemplated to be partlculariy useful in plants for breeding purposes. For instance, by 
knowing the genetic map position of a mutant protox gene that confers heriDicide resistance, 
flanking DNA mart<ers can be identified from a reference genetic map (see, e.g., Helentjaris, 
Trends Genet 3: 217 (1987)). During introgression of the herbicide resistance trait into a 
new breeding line, these markers can then be used to monitor the extent of protox-linked 
flanking chromosomal DNA still present in the recurrent parent after each round of back- 
crossing. 

Protox specific hybridization probes may also be used to quantitate levels of protox 
mRNA in an organism using standard techniques such as Northern blot analysis. This 
technique may be useful as a diagnostic assay to detect altered levels of protox expression 



18 



wo 01/68826 



PCT/EPOl/02753 



that may be associated with particular adverse conditions such as autosomal dominant 
disorder in humans characterized by both neuropsychiatric symptoms and skin lesions, 
which are associated with decreased levels of protox activity (Brenner and Bloomer, New 
Engl. J. Med 302: 765 (1 980)). 

A further embodiment of the invention is a method of producing a DNA molecule 
comprising a DNA portion encoding a protein having protoporphyrinogen oxidase (protox) 
enzyme activity comprising: 

(a) preparing a nucleotide probe capable of specifically hybridizing to a plant 
protox gene or mRNA, wherein the probe comprises a contiguous portion of 
the coding sequence for a protox protein from a plant of at least 10 
nucleotides length; 

(b) probing for other protox coding sequences in populations of cloned genomic 
DNA fragments or cDNA fragments from a chosen organism using the 
nucleotide probe prepared according to step (a); and 

(c) isolating and multiplying a DNA molecule comprising a DNA portion encoding 
a protein having protoporphyrinogen oxidase (protox) enzyme activity. 

A further embodiment of the invention is a method of isolating a DNA molecule from 
any plant comprising a DNA portion encoding a protein having protoporphyrinogen oxidase 
(protox) enzyme activity. 

(a) preparing a nucleotide probe capable of specifically hybridizing to a plant 
protox gene or mRNA, wherein the probe comprises a contiguous portion of 
the coding sequence for a protox protein from a plant of at least 10 
nucleotides length; 

(b) probing for other protox coding sequences in populations of cloned genomic 
DNA fragments or cDNA fragments from a chosen organism using the 
nucleotide probe prepared according to step (a); and 

(c) isolating a DNA molecule comprising a DNA portion encoding a protein 
having protoporphyrinogen oxidase (protox) enzyme activity. 

The Invention further comprises a method of producing an essentially pure DNA 
sequence coding for a protein exhibiting protoporphyrinogen oxidase (protox) enzyme 
activity, which method comprises: 

(a) preparing a genomic or a cDNA library from a suitable source organism using 
an appropriate cloning vector; 

(b) hybridizing the library with a probe molecule; and 
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(c) identifying positive hybridizations of the probe to the DNA clones from the 
library that is clones potentially containing the nucleotide sequence 
corresponding to the amino acid sequence for protoporphyrinogen oxidase 
(protox). 

The invention further comprises a method of producing an essentially pure DNA 
sequence coding for a protein exhibiting protoporphyrinogen oxidase (protox) enzyme 
activity, which method comprises: 

(a) preparing total DNA from a genomic or a cDNA library; 

(b) using the DNA of step (a) as a template for PGR reaction with primers 
representing low degeneracy portions of the amino acid sequence of 
protoporphyrinogen oxidase (protox). 

A further object of the invention is an assay to identify inhibitors of 
protoporphyrinogen oxidase (protox) enzyme activity that comprises: 

(a) incubating a first sample of protoporphyrinogen oxidase (protox) and its 
substrate; 

(b) measuring an uninhibited reactivity of the protoporphyrinogen oxidase 
(protox) from step (a); 

(c) incubating a first sample of protoporphyrinogen oxidase (protox) and its 
substrate in the presence of a second sample comprising an inhibitor 
compound; 

(d) measuring an inhibited reactivity of the protoporphyrinogen oxidase (protox) 
enzyme from step (c); and 

(e) comparing the inhibited reactivity to the uninhibited reactivity of 
protoporphyrinogen oxidase (protox) enzyme. 

A further object of the invention is an assay to identify inhibitor-resistant 
protoporphyrinogen oxidase (protox) mutants that comprises: 

(a) incubating a first sample of protoporphyrinogen oxidase (protox) enzyme and 
its substrate In the presence of a second sample comprising a 
protoporphyrinogen oxidase (protox) enzyme inhibitor; 

(b) measuring an unmutated reactivity of the protoporphyrinogen oxidase 
(protox) enzyme from step (a); 

(c) incubating a first sample of a mutated protoporphyrinogen oxidase (protox) 
enzyme and its substrate in the presence of a second sample comprising 
protoporphyrinogen oxidase (protox) enzyme inhibitor; 
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(d) measuring a mutated reactivity of the mutated protoporphyrinogen oxidase 
(protox) enzyme from step (c); and 

(e) comparing the mutated reactivity to the unmutated reactivity of the 
protoporphyrinogen oxidase (protox) enzyme. 

A further object of the invention is a protox enzyme inhibitor obtained by a method 
according to the invention. 

For recombinant production of the enzyme in a host organism, the protox coding 
sequence may be Inserted into an expression cassette designed for the chosen host and 
introduced into the host where It is recombinantly produced. The choice of specific 
regulatory sequences such as promoter, signal sequence, 5* and 3' untranslated 
sequences, and enhancer, is within the level of skill of the routineer in the art. The resultant 
molecule, containing the individual elements linked in proper reading frame, may be 
inserted into a vector capable of being transfomned into the host cell. Suitable expression 
vectors and methods for recombinant production of proteins are well known for host 
organisms such as £ co//(see, e.g. Studier and Moffatt, J. MoL BioL 189:113 (1986); 
Brosius, DMA 8:759 (1989)), yeast (see, e.g., Schneider and Guarente, Meth. Enzymol. 
194: 373 (1991)) and insect cells (see, e.g., Luckow and Summers, Bio/TechnoL 6: 47 
(1988)). Specific examples include plasmlds such as pBluescript (Stratagene, La Jolla, 
CA), pFLAG (Intemational Biotechnologies. Inc., New Haven, CT), pTrcHis (Invitrogen, La 
Jolla, CA), and baculovims expression vectors, e.g., those derived from the genome of 
Autographica califomica nuclear polyhedrosis virus (AcMNPV). A preferred 
baculovirus/insect system is pVI11392/Sf21 cells (Invitrogen, La Jolla, CA). 

In a preferred embodiment, a coding sequence encoding a mature protox enzyme is 
inserted into an appropriate expression cassette and the mature protox enzyme is 
recombinantly produced in a desired host cell. Alternatively, a coding sequence encoding a 
protox polypeptide precursor comprising an entire plastid transit peptide or a portion of the 
plastid transit peptide Is inserted into an appropriate expression cassette, and the protox 
polypeptide precursor is recombinantly produced In the host cell. The transit peptide is then 
cleaved to release the mature protein. The cleavage is for example carried out by in-vitro 
digestion of the polypeptide precursor, for example using a plastid transit peptide peptidase. 

Recombinantly produced eukaryotic protox enzyme is useful for a variety of 
purposes. For example, it may be used to supply protox enzymatic activity In vitro. It may 
also be used in an in vitro assay to screen known herbicidal chemicals whose target has not 
been identified to determine if they inhibit protox. Such an in vitro assay may also be used 
as a more general screen to identify chemicals that inhibit protox activity and that are 
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therefore herbicide candidates. Recombinantly produced eukaryotic protox enzyme may 
also be used in an assay to identify inhibitor-resistant protox mutants {see International 
application no. PCT/IB95/00452 filed June 8, 1995, published Dec. 21, 1995 as WO 
95/34659, incorporated by reference herein in its entirety). Alternatively, recombinantly 
produced protox enzyme may be used to further characterize its association with known 
inhibitors in order to rationally design new inhibitory herbicides as well as herbicide tolerant 
forms of the enzyme. 

II. Inhibitor Resistant Plant Protox Enzymes 

In another aspect, the present invention teaches modifications that can be made to 
the amino acid sequence of any eukaryotic protoporphyrinogen oxidase (referred to herein 
as '"protox") enzyme to yield an inhibitor-resistant forni of this enzyme. Preferably, the 
eukaryotic protox enzyme is a plant protox enzyme. The present invention is directed to 
inhibitor-resistant protox enzymes having the modifications taught herein, to DNA molecules 
encoding these modified enzymes, and to chimeric genes capable of expressing these 
modified enzymes in plants. 

The present invention is thus directed to an isolated DNA molecule encoding a 
modified eukaryotic protoporphyrinogen oxidase (protox) having at least one amino acid 
modification, wherein the amino acid modification has the property of confemng resistance 
to a protox inhibitor, that is wherein the modified protox is tolerant to an inhibitor in amounts 
that inhibit the naturally occuning eukaryotic protox. As used herein "inhibif refers to a 
reduction in enzymatic activity obsen^ed in the presence of a subject compound compared 
to the level of activity observed in the absence of the subject compound, wherein the 
percent level of reduction is preferably at least 10%, more preferably at least 50%, and 
most preferably at least 90%. 

Preferred is a DNA molecule encoding a modified eukaryotic protoporphyrinogen 
oxidase (protox) that is a plant protox, wherein the modified protox is tolerant to a heriDicide 
in amounts that inhibit the naturally occurring protox activity. Even more preferred is a 
protox selected from the group consisting of an Arabidopsis protox enzyme, a maize protox 
enzyme, a wheat protox enzyme, a soybean protox enzyme, a cotton protox enzyme, a 
sugar beet protox enzyme, an oilseed rape protox enzyme, a rice protox enzyme, a 
sorghum protox enzyme, and a sugar cane protox enzyme having at least one amino acid 
modification, wherein the modified protox is tolerant to a herisicide in amounts that inhibit 
the naturally occurring protox activity. 
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As used herein, the expression "substantially conserved amino acid sequences" 
refers to regions of amino acid homology between polypeptides comprising protox enzymes 
from different sources. In the present invention, seventeen substantially conserved amino 
acid sub-sequences, designated 1-19 respectively, are shown in Table 1B. One skilled in 
the art could align the amino add sequences of protox enzymes from different sources, as 
has been done in Table 1 A, to identify the sub-sequences therein that make up the 
substantially conserved amino acid sequences defined herein. Stated another way, a given 
sub-sequence from one source "corresponds to" a homologous subsequence from a 
different source. The skilled person could then detennine whether the identified sub- 
sequences have the characteristics disclosed and claimed in the present application. 

Therefore, a prefenred embodiment of the present invention is directed to a nucleic 
acid molecule comprising a nucleotide sequence isolated from a plant that encodes an 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the nucleic acid 
molecule is capable of being incorporated into a nucleic acid construct used to transform a 
plant containing wild-type, herbicide-sensitive protox, wherein the nucleotide sequence has 
at least one point mutation relative to a wild-type nucleotide sequence encoding plant 
protox, such that upon transformation with the nucleic acid construct the plant is rendered 
herbicide-tolerant. 

IVlore particularly, a preferred embodiment of the present invention Is directed to a 
nucleic acid molecule comprising a nucleotide sequence isolated from a plant that encodes 
a modified' enzyme having protoporphyrinogen oxidase (protox) activity, wherein the 
modified enzyme is resistant to an inhibitor of a naturally occurring protox enzyme, wherein 
the modified enzyme comprises at least one of the following amino acid sub-sequences: 



(a) APAiF, wherein Ai is an amino acid other than arginine; 

(b) FA2S, wherein A2 is an amino acid other than cysteine; 

(c) YA3G, wherein A3 is an amino acid other than alanine; 

(d) AA4D, wherein A4 is an amino acid other than glycine; 
(©) YA5P, wherein A5 is an amino acid other than proline; 

(f) PAsA, wherein As is an amino acid other than valine; 

(g) A7IG, wherein A7 is an amino acid other than tyrosine; 

(h) YIGGAa, wherein As Is an amino acid other than alanine or serine; 

(i) AAgP, wherein A9 is an amino acid other than isoleucine; and 
(j) GA10A, wherein A10 is an amino acid other than valine; 

(k) KAAieF, wherein Aia is an amino acid other than alanine; 
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(I) QA19H, wherein A19 is an amino acid other than leucine. 

(Table 1B; sub-sequences 1-10 and 18-19). 

Prefenred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an Inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence APA 
iF, wherein Ai is an amino acid other than arginine. Most preferably, Ai is cysteine or 
leucine. 

Pretended is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence FA 
2S, wherein A2 is an amino acid other than cysteine. Most preferably, A2 is phenylalanine, 
leucine, or lysine. 

Prefen-ed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence YA 
3G, wherein A3 is an amino acid other than alanine. Most preferably, A3 is valine, threonine, 
leucine, cysteine, or isoleucine. 

Prefen-ed is a nucleic acid molecule comprising a nucleotide sequence Isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occumng 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence AA 
4D, wherein A4 is an amino acid other than glycine. Most preferably, A4 is serine or leucine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence A5 is 
an amino acid other than proline. Most preferably, As is serine or histidine. 

Prefenred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occumng 
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protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence PA 
gA, wherein Ae is an amino acid other than valine. Most preferably, Ae is leucine. 

PrefenBd is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrlnogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence A 
7IG, wherein Ay is an amino acid other than tyrosine. iVIost preferably, A7 is cysteine, 
isoieucine, leucine, threonine, methionine, valine, alanine, histidine, or arginine. 

Prefen'ed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence FA 
2S, wherein As is an amino acid other than alanine or serine. Most preferably, As is proline. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence AA 
gP, wherein A9 is an amino acid other than isoieucine. Most preferably, A9 is threonine, 
histidine, glycine, or asparagine. 

Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence GA 
10A, wherein A10 is an amino add other than valine. Most preferably, A10 Is alanine. 

Prefen'ed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme Is resistant to an Inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence KAA 
leF, wherein Aie is an amino acid other than alanine. Most preferably, Aia is threonine or 
valine. 

Prefen'ed is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
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protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence QA 
19H, wherein Aig is an amino acid other than leucine. Most preferably, Aig is serine. 

Another prefered embodiment of the present invention is directed to a nucleic acid 
molecule comprising a nucleotide sequence isolated from a plant that encodes a modified 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified enzyme 
is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the modified 
enzyme comprises at least one of the following amino acid sub-sequences: 



(a) 


APAiF, wherein Ai is an amino acid other than arginine; 


(b) 


FA2S, wherein A2 is an amino acid other than cysteine; 


(c) 


YA3G, wherein A3 is an amino acid other than alanine; 


(d) 


AA4D, wherein A4 is an amino acid other than glycine; 


(e) 


YA5P, wherein A5 is an amino acid other than proline; 


(f) 


PAeA. wherein As is an amino acid other than valine; 


(g) 


A7IG, wherein A7 is an amino acid other than tyrosine; 


(h) 


YIGGAa, wherein As is an amino acid other than alanine or serine; 


(i) 


AA9P, wherein A9 is an amino acid other than isoleucine; and 


(j) 


GA10A, wherein A10 is an amino acid other than valine 


(k) 


KAAisF, wherein Ais is an amino acid other than alanine; 


(1) 


QA19H, wherein A19 is an amino acid other than leucine. 


(Table 1B; sub-sequences 1-10 and 18-19), and wherein the modified enzyme further 


comprises at least one additional amino acid sub-sequence selected from the group 


consisting of: 




(m) 


QA11S, wherein An is an amino acid other than proline; 


(n) 


IGGA12, wherein A12 is an amino acid other than threonine; 


(0) 


SWXLAia, wherein A13 is an amino acid other than serine; 


(P) 


LA14Y, wherein A14 is an amino acid other than asparagine; and 


(q) 


GAisXGL, wherein A15 is an amino acid other than tyrosine. 



Preferred is a nucleic acid molecule comprising a nucleotide sequence isolated from 
a plant that encodes a modified enzyme having protoporphyrinogen oxidase (protox) 
activity, wherein the modified enzyme is resistant to an inhibitor of a naturally occurring 
protox enzyme, wherein the modified enzyme comprises the amino acid sub-sequence YA 
3G, wherein A3 is an amino acid other than alanine, or the amino acid sub-sequence A7IG, 
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wherein A7 is an amino acid other than tyrosine, and wherein the modified enzyme further 
comprises at least one additional amino acid sub-sequence selected from the group 
consisting of: 

(m) QA11S, wherein An is an amino acid other than proline; 
(n) IGGA12, wherein A12 is an amino acid other than threonine; 
(o) SWXLA13, wherein A13 is an amino acid other than serine; 
(p) LA14Y, wherein A^ is an amino acid other than asparagine; and 
(q) GA15XGL, wherein A15 is an amino acid other than tyrosine. 
Preferably, An is leucine, At2 is isoleucine or alanine, A13 is leucine, Au is serine, 
and Ai5 is cysteine. 

Another preferred embodiment of the present invention is directed to a nucleic acid 
molecule comprising a nucleotide sequence isolated from a plant that encodes a modified 
enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified enzyme 
is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the modified 
enzyme comprises: the amino acid sub-sequence A7IG, wherein Ay is an amino acid other 
than tyrosine; the amino acid sub-sequences IGGA12, wherein A12 is an amino acid other 
than threonine; and the amino acid sub-sequence SWXLA13, wherein A13 is an amino acid 
other than serine. Most preferably, Ay is isoleucine, A12 is isoleucine, and A13 is leucine. 

Yet another preferred embodiment of the present invention is directed to a nucleic 
acid molecule comprising a nucleotide sequence isolated from a plant that encodes a 
modified enzyme having protoporphyrinogen oxidase (protox) activity, wherein the modified 
enzyme is resistant to an inhibitor of a naturally occurring protox enzyme, wherein the 
nucleotide sequence is further characterized in that at least one of the following conditions 
is met: 

(a) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence APAiF, wherein Ai is an amino acid other than arginine; 

(b) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence FA2S. wherein A2 is an amino acid other than cysteine; 

(c) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YA3G, wherein A3 is an amino acid other than alanine; 

(d) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence AA4D, wherein A4 is an amino acid other than glycine; 
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(e) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YA5P, wherein As is an amino acid other than proline; 

(f) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence PAsA, wherein As is an amino acid other than valine; 

(g) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence A7IG, wherein A7 is an amino acid other than tyrosine; 

(h) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YIGGAe, wherein As is an amino acid other than alanine or serine; 

(i) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence AA9P, wherein A9 is an amino acid other than isoleucine; 

(j) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence GAioA, wherein Aio is an amino acid other than valine; 

(k) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence YAsG, wherein A3 is an amino acid other than alanine, and the 
nucleic acid sequence also has a sequence that encodes one of the group 
consisting of: 

(1) sub-sequence QAnS, wherein An is an amino acid other than proline, 

(2) sub-sequence IGQA12, wherein A12 is an amino acid other than threonine, 

(3) sub-sequence SWXLA13, wherein A13 is an amino acid other than serine, 

(4) sub-sequence LA14Y, wherein A14 is an amino acid other than asparagine, 

and 

(5) sub-sequence GA15XGL, wherein A15 is an amino acid other than tyrosine; 
(I) the nucleic acid sequence has a sequence that encodes amino acid sub- 
sequence A7IG, wherein A7 is an amino acid other than tyrosine, and the 
nucleic acid sequence also has a sequence that encodes one of the group 
consisting of: 

(1 ) sub-sequence QAnS, wherein An is an amino acid other than proline, 

(2) sub-sequence IGGA12, wherein A12 Is an amino acid other than threonine, 

(3) sub-sequence SWXLA13, wherein A13 is an amino acid other than serine, 

(4) sub-sequence LA14Y, wherein A14 is an amino acid other than asparagine, 

and 

(5) sub-sequence GA15XGL. wherein A15 is an amino acid other than tyrosine; 
and 
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(m) the nucleic has a sequence that encodes amino acid sub-sequence TAieG, 
wherein Aie is an amino acid other than leucine, and the nucleic acid 
sequence also has a sequence that encodes amino acid sub-sequence YVA 
17G. wherein A17 is an amino acid other than alanine. 
. (n) KAAisF, wherein Ais is an amino acid other than alanine; 

(o) QA19H, wherein A19 is an amino acid other than leucine. 

Preferably, said nucleic acid sequence has a sequence that encodes amino acid 
sub-sequence TAieG, wherein Aie is an amino acid other than leucine, and said nucleic acid 
sequence also has a sequence that encodes amino acid sub-sequence YVA17G, wherein A 
17 is an amino acid other than alanine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the arginine occurring at the position 
corresponding to amino acid 88 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is the DNA molecule wherein the arginine is 
replaced with a cysteine or a leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the cysteine occurring at the position 
corresponding to amino acid 159 of SEQ ID NO:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that Inhibit the naturally 
occurring protox activity. Particularly preferred is the DNA molecule wherein the cysteine is 
replaced with a phenylalanine or lysine, most prefenred, wherein the cysteine is replaced 
with a phenylalanine. 

Also prefen-ed is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the isoleucine occuning at the position corresponding to 
amino acid 419 of SEQ ID NO:6 is replaced with another amino acid, wherein the modified 
protox is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity. 
Particulariy prefen-ed is a DNA molecule, wherein the isoleucine is replaced with a 
threonine, histidine, glycine or asparaglne most preferred, wherein the Isoleucine is 
replaced with a threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
corresponding to amino acid 164 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a heriDlcide in amounts that inhibit the naturally 
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occurring protox activity. Particularly preferred is a DNA molecule wherein the alanine is 
replaced with a threonine, leucine or valine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the glycine occurring at the position 
corresponding to amino acid 165 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the glycine is 
replaced with a serine or leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occurring at the position 
con-esponding to amino acid 370 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a heri3icide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the tyrosine is 
replaced with a isoleucine or methionine. 

Also prefenred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
corresponding to amino acid 175 of SEQ ID N0;6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide In amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the alanine is 
replaced with a valine or threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the leucine occurring at the position 
corresponding to amino acid 337 of SEQ ID N0:6 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that Inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the leucine is 
replaced with a serine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occurring at the position corresponding 
to amino acid 356 of SEQ ID N0:10 is replaced with another amino acid, wherein the 
modified protox is tolerant to a hert^icide in amounts that inhibit the naturally occurring 
protox activity. Particulariy preferred is a DNA molecule wherein the valine is replaced with a 
leucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the serine occurring at the position 
corresponding to amino acid 421 of SEQ ID NO:10 is replaced with another amino acid, 
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wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the serine is 
replaced with a proline. 

Also prefered is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occumng at the position corresponding 
to amino acid 502 of SEQ ID NO:10 is replaced with another amino acid, wherein the 
modified protox is tolerant to a herbicide in amounts that inhibit the naturally occurring 
protox activity. Particularly preferred is a DNA molecule wherein the valine is replaced with a 
alanine. 

Also preferred Is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
con^esponding to amino acid 21 1 of SEQ ID NO:10 Is replaced with another amino acid, 
wherein the modified protox is tolerant to a hertDicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the alanine is 
replaced with a valine or threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the glycine occumng at the position 
corresponding to amino acid 212 of SEQ ID NO:10 is replaced with another amino acid, 
wherein the modified protox is tolerant to a heriDicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the glycine is 
replaced with a serine. 

Also preferred is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the isoleucine occurring at the position corresponding to 
amino acid 466 of SEQ ID N0:10 is replaced with another amino acid, wherein the modified 
protox Is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity. 
Particulariy preferred is a DNA molecule wherein the isoleucine Is replaced with a threonine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the proline occurring at the position 
corresponding to amino acid 369 of SEQ ID NO:12 is replaced with another amino acid, 
wherein the modified protox is tolerant to a heriDicide in amounts that inhibit the naturally 
occurring protox activity. Particularly preferred is a DNA molecule wherein the proline is 
replaced with a serine or histldlne. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the alanine occurring at the position 
corresponding to amino acid 226 of SEQ ID NO:12 is replaced with another amino acid, 
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wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occumng protox activity. Particularly pretended is a DNA molecule, wherein the alanine is 
replaced with a threonine or leucine. 

Also prefered is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the valine occuning at the position con'esponding 
to amino acid 517 of SEQ ID N0:12 is replaced with another amino acid, wherein the 
modified protox is tolerant to a herbicide in amounts that inhibit the naturally occumng 
protox activity. Particularly preferred is a DNA molecule wherein the valine is replaced with a 
alanine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occurring at the position 
corresponding to amino acid 432 of SEQ ID NO:12 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occumng protox activity. Particularly preferred is a DNA molecule wherein the tyrosine is 
replaced with a leucine or isoleucine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the proline occunring at the position 
con-esponding to amino acid 365 of SEQ ID NO:16 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occumng protox activity. Particulariy preferred is a DNA molecule wherein the proline is 
replaced with a serine. 

Also preferred is a DNA molecule encoding a modified protoporphyrinogen oxidase 
(protox) comprising a plant protox wherein the tyrosine occurring at the position 
corresponding to amino acid 428 of SEQ ID N0:16 is replaced with another amino acid, 
wherein the modified protox is tolerant to a herbicide in amounts that inhibit the naturally 
occurring protox activity. Particulariy preferred is a DNA molecule wherein the tyrosine is 
replaced with a cysteine, histidine or arginine. 

Also preferred is a DNA encoding a modified protoporphyrinogen oxidase (protox) 
comprising a plant protox wherein the tyrosine occumng at the position corresponding to 
amino acid 449 of SEQ ID N0:18 is replaced with another amino acid, wherein the modified 
protox is tolerant to a herbicide in amounts that inhibit the naturally occurring protox activity. 
Particulariy preferred is a DNA molecule wherein the tyrosine is replaced with a cysteine, 
leucine, isoleucine, valine or methionine. 

The present invention is further directed to a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino acid 
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substitution and a second amino acid substitution; the first amino acid substitution having 
the property of confening resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance confenred by the first amino 
acid substitution. Preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant Is selected from the group 
consisting of maize, wheat, soybean, cotton, sugar beet, oilseed rape, rice, sorghum, sugar 
cane, and Arabidopsis. More preferred is a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox, wherein the plant is 
selected from the group consisting of maize, wheat, soybean, sugar beet, and Arabidopsis, 
Preferred is a DNA molecule wherein the second amino acid substitution occurs at a 
position selected from the group consisting of: 

(a) the position corresponding to the serine at amino acid 305 of SEQ ID N0:2; 

(b) the position con-esponding to the threonine at amino acid 249 of SEQ ID NO:2; 

(c) the position corresponding to the proline at amino acid 1 18 of SEQ ID N0:2; 

(d) the position corresponding to the asparagine at amino acid 425 of SEQ ID N0:2; 

and 

(e) the position corresponding to the tyrosine at amino acid 498 of SEQ ID NO:2. 
Also pref en-ed is a DNA molecule wherein the first amino acid substitution occurs at 

a position selected from the group consisting of: 

(a) the position con-esponding to the arginine at amino acid 88 of SEQ ID NO:6; 

(b) the position con-esponding to the alanine at amino acid 164 of SEQ ID NO:6; 

(c) the position corresponding to the glycine at amino acid 165 of SEQ ID NO:6; 

(d) the position con-esponding to the tyrosine at amino acid 370 of SEQ ID NO:6; 

(e) the position con-esponding to the cysteine at amino acid 159 of SEQ ID N0:6; 
(0 the position conresponding to the Isoleucine at amino acid 419 of SEQ ID NO:6. 

(g) the position con-esponding to the valine at amino acid 356 of SEQ ID NO:10; 

(h) the position con-esponding to the serine at amino acid 421 of SEQ ID NO:10; 

(i) the position corresponding to the valine at amino acid 502 of SEQ ID N0:10; 
G) the position corresponding to the alanine at amino acid 21 1 of SEQ ID NO:10; 
(k) the position corresponding to the glycine at amino acid 212 of SEQ ID NO:10; 
(I) the position con-esponding to the isoleucine at amino acid 466 of SEQ ID NO:10; 
(m) the position corresponding to the proline at amino acid 369 of SEQ ID NO:12; 
(n) the position corresponding to the alanine at amino acid 226 of SEQ ID N0:12; 
(o) the position corresponding to the tyrosine at amino acid 432 of SEQ ID NO:12; 
(p) the position corresponding to the valine at amino acid 517 of SEQ ID NO: 12; 
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c 

(q) the position corresponding to the tyrosine at amino acid 428 of SEQ ID N0:16; 
(r) the position con'esponding to the proline at amino acid 365 of SEQ ID N0:16; 
(s) the position conresponding to the tyrosine at amino acid 449 of SEQ ID N0:18; 
(t) the position con'esponding to the alanine at amino acid 175 of SEQ ID NO:6; 
and 

(u) the position con'esponding to the leucine at amino acid 337 of SEQ ID N0:6. 
Particularly prefen^ed is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox wherein the plant protox comprises an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, 16. 18, 
20, 22, and 37. Most preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant protox comprises an amino 
acid sequence selected from the group consisting of SEQ ID NOs: 2, 4, 6, 8, 10, 12, and 
18. 

More preferred is a DNA molecule, wherein the first amino acid substitution occurs at 
a position selected from the group consisting of: 

(a) the position con'esponding to the arginine at amino acid 88 of SEQ ID NO:6; 

(b) the position conresponding to the alanine at amino acid 164 of SEQ ID N0:6; 

(c) the position corresponding to the glycine at amino acid 165 of SEQ ID NO:6; 

(d) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6; 

(e) the position conresponding to the cysteine at amino acid 159 of SEQ ID N0:6; 

(f) the position con'esponding to the isoleucine at amino acid 419 of SEQ ID N0:6 

(g) the position con-esponding to the alanine at amino acid 175 of SEQ ID NO:6; 

and 

(h) the position con-esponding to the leucine at amino acid 337 of SEQ ID N0:6. 
More prefen-ed is a DNA molecule wherein the second amino acid substitution 

occurs at the position conresponding to the serine at amino acid 305 of SEQ ID NO;2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 1 64 of SEQ ID N0:6; 

and 

(b) the position corresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly prefen-ed Is a DNA molecule wherein the serine occurring at the position 

corresponding to amino acid 305 of SEQ ID N0:2 is replaced with leucine. 

More preferred Is a DNA molecule wherein the second amino acid substitution 
occurs at the position corresponding to the threonine at amino acid 249 of SEQ ID N0:2 
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and the first amino acid substitution occurs at a position selected from the group consisting 
of: 

(a) the position con^esponding to the alanine at amino acid 164 of SEQ ID N0:6; 

and 

(b) the position con-esponding to the tyrosine at amino acid 370 of SEQ ID NO:6. 
Particularly prefenred is a DNA wherein the threonine occurring at the position 

con^esponding to amino acid 249 of SEQ ID N0:2 is replaced with an amino acid selected 
from the group consisting of isoleucine and alanine. 

More prefen-ed is a DNA molecule wherein the second amino acid substitution 
occurs at the position con^esponding to the proline at amino acid 1 1 8 of SEQ ID NO:2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID NO:6; 

and 

(b) the position conresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly prefenred is a DNA molecule wherein the proline occurring at the position 

corresponding to amino acid 1 18 of SEQ ID N0:2 is replaced with a leucine. 

More preferred is a DNA molecule wherein the second amino acid substitution 
occurs at the position corresponding to the asparagine at amino acid 425 of SEQ ID NO:2 
and the first amino acid substitution occurs at a position selected from the group consisting 
of: 

(a) the position corresponding to the alanine at amino acid 1 64 of SEQ ID N0:6; 

and 

(b) the position con-esponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly preferred is a DNA molecule wherein the asparagine occurring at the 

position corresponding to amino acid 425 of SEQ ID N0:2 is replaced with a serine. 

More prefenred is a DNA molecule wherein the second amino acid substitution 
occurs the position corresponding to the tyrosine at amino acid 498 of SEQ ID N0:2 and 
the first amino acid substitution occurs at a position selected from the group consisting of: 

(a) the position corresponding to the alanine at amino acid 164 of SEQ ID NO:6; 

and 

(b) the position conresponding to the tyrosine at amino acid 370 of SEQ ID N0:6. 
Particularly prefeaed is a DNA molecule wherein the tyrosine occurring at the 

position con-esponding to amino acid 498 of SEQ ID N0:2 is replaced with a cysteine. 
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More preferred is a DNA molecule wherein the tyrosine occuning at the position 
corresponding to amino acid 370 of SEQ ID N0:6 is replaced with an amino acid selected 
from the group consisting of cysteine, isoleucine, leucine, threonine, valine and methionine. 

Particularly preferred is a DNA molecule wherein the tyrosine occurring at the 
position corresponding to amino acid 370 of SEQ ID NO:6 is replaced with an amino acid 
selected from the group consisting of cysteine, isoleucine, leucine, threonine and 
methionine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
corresponding to residue 164 of SEQ ID NO:6 is replaced with an amino acid selected from 
the group consisting of valine, threonine, leucine, cysteine and tyrosine. 

More prefenred is a DNA molecule wherein the glycine occurring at the position 
corresponding to residue 165 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of serine and leucine. 

Particularly preferred is a DNA molecule wherein the glycine occumng at the position 
corresponding to residue 165 of SEQ ID N0:6 is replaced with a serine. 

Particularty preferred Is a DNA molecule wherein the arginine occumng at the 
position con-esponding to residue 88 of SEQ ID NO:6 is replaced with a cysteine or a 
leucine. 

More preferred is a DNA molecule wherein the cysteine occurring at the position 
corresponding to residue 159 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of phenylalanine and lysine. 

Particularly preferred is a DNA molecule wherein the cysteine occurring at the 
position corresponding to residue 159 of SEQ ID N0:6 is replaced with a phenylalanine. 

More preferred is a DNA molecule wherein the isoleucine occumng at the position 
corresponding to residue 419 of SEQ ID N0:6 is replaced with an amino acid selected from 
the group consisting of threonine, histidine, glycine and asparagine. 

Particularly preferred is a DNA molecule wherein the isoleucine occurring at the 
position con^esponding to residue 419 of SEQ ID N0:6 is replaced with a threonine. 

Particularty preferred Is a DNA molecule wherein the alanine occurring at the 
position corresponding to residue 175 of SEQ ID NO:6 is replaced with an amino acid 
selected from the group consisting of threonine and valine. 

Particularly preferred is a DNA molecule wherein the leucine occumng at the 
position con'esponding to residue 337 of SEQ ID N0:6 is replaced with a serine. 

More preferred is a DNA molecule wherein the valine occurring at the position 
corresponding to residue 356 of SEQ ID NO: 10 is replaced with a leucine. 
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More preferred is a DNA molecule wherein the serine occuning at the position 
corresponding to residue 421 of SEQ ID N0:10 is replaced with a proline. 

More preferred is a DNA molecule wherein the valine occuning at the position 
corresponding to residue 502 of SEQ ID NO:10 is replaced with a alanine. 

More preferred is a DNA molecule wherein the isoleucine occumng at the position 
conresponding to residue 466 of SEQ ID N0:10 is replaced with a threonine. 

More prefen^ed is a DNA molecule wherein the glycine occurring at the position 
con'esponding to residue 212 of SEQ ID N0:10 is replaced with a serine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
conresponding to residue 21 1 of SEQ ID NO:i0 is replaced with a valine or threonine. 

More preferred is a DNA molecule wherein the proline occurring at the position 
corresponding to residue 369 of SEQ ID N0:12 is replaced with a serine or a histidine. 

More preferred is a DNA molecule wherein the alanine occurring at the position 
con'esponding to residue 226 of SEQ ID N0:12 is replaced with a leucine or threonine. 

More prefen-ed is a DNA molecule wherein the tyrosine occurring at the position 
corresponding to residue 432 of SEQ ID N0:12 is replaced with a leucine or isoleucine. 

More preferred is a DNA molecule wherein the valine occuning at the position 
con'esponding to residue 517 of SEQ ID N0:12 is replaced with a alanine. 

More preferred is a DNA molecule wherein the tyrosine occurring at the position 
conresponding to residue 428 of SEQ ID N0:16 is replaced with cysteine, histidine or 
arginine. 

More preferred is a DNA molecule wherein the proline occurring at the position 
corresponding to residue 365 of SEQ ID N0:16 is replaced with serine. 

More preferred is a DNA molecule wherein the proline occurring at the position 
conresponding to residue 449 of SEQ ID N0:18 is replaced with an amino acid selected 
from the group consisting of leucine, isoleucine, valine and methionine. 

The present invention is still further directed to a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a double amino acid 
substitution, wherein both amino acid substitutions are required for there to be resistance to 
a protox inhibitor. Preferred is a DNA molecule encoding a modified protoporphyrinogen 
oxidase (protox) comprising a plant protox, wherein the plant is selected from the group 
consisting of maize, wheat, soybean, cotton, sugar beet, oilseed rape, rice, sorghum, sugar 
cane, and Arabidopsls, More prefen^ed is a DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox, wherein the plant is maize. 
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Preferred is a DNA molecule having a double amino acid substitution, wherein one 
amino acid substitution occurs at the position conresponding to the leucine at amino acid 
347 of SEQ ID N0:6, and wherein the second amino acid substitution occurs at the position 
corresponding to the alanine at amino acid 453 of SEQ ID N0:6. 

Particulariy preferred is a DNA molecule having a double amino acid substitution, 
wherein a leucine occumng at the position con-esponding to amino acid 347 of SEQ ID 
N0:6 is replaced with a serine, and wherein an alanine occuning at the position 
corresponding to amino acid 453 of SEQ ID N0:6 is replaced with a threonine. 

The present invention is directed to expression cassettes and recombinant vectors 
comprising the expression cassettes comprising essentially a promoter, but especially a 
promoter that is active in a plant, operatively linked to a DNA molecule encoding the 
protoporphyrinogen oxidase (protox) enzyme from a eukaryotic organism according to the 
invention. The expression cassette according to the invention may In addition further 
comprise a signal sequence operatively linked to the DNA molecule, wherein the signal 
sequence is capable of targeting the protein encoded by the DNA molecule into the 
chloroplast or the mitochondria. 

The invention relates to a chimeric gene, which comprises an expression cassette 
comprising essentially a promoter, but especially a promoter that is active in a plant, 
operatively linked to a heterologous DNA molecule encoding a protoporphyrinogen oxidase 
(protox) enzyme from a eukaryotic organism according to the invention. Preferred is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting of Arabidopsis, sugar cane, 
soybean, bariey, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, 
oats, turf and forage grasses, millet, forage and rice. More preferred is a chimeric gene, 
wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) enzyme from a 
plant selected from the group consisting of soybean, cotton, tobacco, sugar beet, oilseed 
rape, maize, wheat, sorghum, rye, oats, turf grass, and rice. Particulariy prefen^ed is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting of wheat, soybean, cotton, sugar 
beet, oilseed rape, rice and sorghum. Most preferred is a chimeric gene, wherein the DNA 
molecule encodes an protoporphyrinogen oxidase (protox) enzyme from a plant selected 
from the group consisting of soybean, sugar beet, and wheat. 

More preferred is a chimeric gene comprising a promoter active in a plant operatively 
linked to a heterologous DNA molecule encoding a protoporphyrinogen oxidase (protox) 
selected from the group consisting of a wheat protox comprising the sequence set forth in 
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SEQ ID NO:10. a soybean protox comprising the sequence set forth In SEQ ID NO:12, 
cotton protox comprising the sequence set forth in SEQ ID N0:16, a sugar beet protox 
comprising the sequence set forth in SEQ ID N0:18, an oilseed rape protox comprising the 
sequence set forth in SEQ ID NO:20, a rice protox comprising the sequence set forth in 
SEQ ID NO:22, a sorghum protox comprising the sequence set forth in SEQ ID NO:24, and 
a sugar cane protox comprising the sequence set forth in SEQ ID NO:37. IVIore prefenred is 
a chimeric gene, wherein the protoporphyrinogen oxidase (protox) is selected from the 
group consisting of a wheat protox comprising the sequence set forth in SEQ ID NO:10, a 
soybean protox comprising the sequence set forth in SEQ ID N0:12, and a sugar beet 
protox comprising the sequence set forth in SEQ ID N0:18. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from an Arabidopsis species having protox-1 activity or protox-2 activity, preferably 
wherein the protein comprises the amino acid sequence set forth in SEQ ID NO:2 or SEQ 
ID N0:4. 

Particuiariy preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from maize having protox-1 activity or protox-2 activity, preferably wherein the 
protein comprises the amino acid sequence set forth in set forth in SEQ ID NO:6 or SEQ ID 
N0:8. 

Particularly prefenred is a chimeric gene, wherein the DNA molecule encodes a 
protein from wheat having protox-1 activity preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NOrlO. 

Particularly prefenred is a chimeric gene, wherein the DNA molecule encodes a 
protein from soybean having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID N0:12. 

Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from cotton having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID N0:16. 

Particuiariy prefen-ed is a chimeric gene, where|n the DNA molecule encodes a 
protein from sugar beet having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO:18. 

Particuiariy preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from oilseed rape having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO:20. 
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Particularly preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from rice having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:22. 

Partlculariy preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from sorghum having protox-1 activity, preferably wherein the protein comprises the 
amino acid sequence set forth in SEQ ID NO:24. 

Particulariy preferred is a chimeric gene, wherein the DNA molecule encodes a 
protein from sugar cane having protox-1 activity, preferably wherein the protein comprises 
the amino acid sequence set forth in SEQ ID NO:37. 

The invention also embodies a chimeric gene, which comprises an expression 
cassette comprising essentially a promoter, but especially a promoter that is active in a 
plant, operatively linked to the DNA molecule encoding an protoporphyrinogen oxidase 
(protox) enzyme from a eukaryotic organism according to the invention, which is resistant to 
heriDicides at levels that inhibit the conresponding unmodified version of the enzyme. 
Preferred is a chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen 
oxidase (protox) enzyme from a plant selected from the group consisting of Arabidopsis, 
sugar cane, soybean, bartey, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, 
sorghum, rye, oats, turf and forage grasses, millet, forage and rice. More preferred is a 
chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen oxidase (protox) 
enzyme from a plant selected from the group consisting of soybean, cotton, tobacco, sugar 
beet, oilseed rape, maize, wheat, sorghum, rye, oats, turf grass, and rice. Particularly 
preferred is a chimeric gene, wherein the DNA molecule encodes an protoporphyrinogen 
oxidase (protox) enzyme from a plant selected from the group consisting of Arabidopsis, 
soybean, cotton, sugar beet, oilseed rape, maize, wheat, sorghum, and rice. 

Encompassed by the present invention is a chimeric gene comprising a promoter 
that is active in a plant operatively linked to the DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a eukaryotic protox having at least one 
amino acid modification, wherein the amino acid modification has the property of conferring 
resistance to a protox inhibitor. 

Also encompassed by the present invention is a chimeric gene comprising a 
promoter that is active in a plant operatively linked to the DNA molecule encoding a 
modified protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino 
acid substitution and a second amino acid substitution; the first amino acid substitution 
having the property of confening resistance to a protox inhibiton and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
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acid substitution. Preferred is the chimeric gene additionally comprising a signal sequence 
operatively linked to the DNA molecule, wherein the signal sequence is capable of targeting 
the protein encoded by the DNA molecule into the chloroplast or in the mitochondria. 

The chimeric gene according to the invention may in addition further comprise a 
signal sequence operatively linked to the DNA molecule, wherein the signal sequence is 
capable of targeting the protein encoded by the DNA molecule into the chloroplast. The 
chimeric gene according to the invention may in addition further comprise a signal 
sequence operatively linked to the DNA molecule, wherein the signal sequence is capable 
of targeting the protein encoded by the DNA molecule into the mitochondria. 

Also encompassed by the present invention is any of the DNA sequences 
mentioned herein before, which is stably integrated into a host genome. 

The invention further relates to a recombinant DNA molecule comprising a plant 
protoporphyrinogen oxidase (protox) or a functionally equivalent derivative thereof. 

The invention further relates to a recombinant DNA vector comprising the 
recombinant DNA molecule of the invention. 

A further object of the invention is a recombinant vector comprising the chimeric 
gene according to the invention, wherein the vector is capable of being stably transfonned 
into a host cell. 

A further object of the invention is a recombinant vector comprising the chimeric 
gene according to the invention, wherein the vector is capable of being stably transfonned 
into a plant, plant seeds, plant tissue or plant cell. Prefen-ed is a recombinant vector 
comprising the chimeric gene according to the invention, wherein the vector is capable of 
being stably transfonned Into a plant The plant, plant seeds, plant tissue or plant cell stably 
transfonmed with the vector is capable of expressing the DNA molecule encoding a 
protoporphyrinogen oxidase (protox). Preferred Is a recombinant vector, wherein the plant, 
plant seeds, plant tissue or plant cell stably transfonmed with the the vector is capable of 
expressing the DNA molecule encoding a protoporphyrinogen oxidase (protox) from a plant 
that is resistant to hert^icides at levels that inhibit the corresponding unmodified version of 
the enzyme. 

Preferred is a recombinant vector comprising the chimeric gene comprising a 
promoter active in a plant operatively linked to a heterologous DNA molecule encoding a 
protoporphyrinogen oxidase (protox) selected from the group consisting of a wheat protox 
comprising the sequence set forth tn SEQ ID NO:10, a soybean protox comprising the 
sequence set forth in SEQ ID N0:12, cotton protox comprising the sequence set forth in 
SEQ ID N0:1 6, a sugar beet protox comprising the sequence set forth In SEQ ID N0:18. an 
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Oilseed rape protox comprising the sequence set forth in SEQ ID N0:20, a rice protox 
comprising the sequence set forth in SEQ ID NO:22, a sorghum protox comprising the 
sequence set forth in SEQ ID NO:24, and a sugar cane protox comprising the sequence set 
forth in SEQ ID NO:37, wherein the vector is capable of being stably transformed into a host 
cell. 

Also pretended is recombinant vector comprising the chimeric gene comprising a 
promoter that is active in a plant operatively linked to the DNA molecule encoding a 
modified protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino 
acid substitution and a second amino acid substitution; the first amino acid substitution 
having the property of confening resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conferred by the first amino 
acid substitution, wherein the vector is capable of being stably transformed into a plant cell. 

Also encompassed by the present invention is a host cell stably transformed with the 
vector according to the invention, wherein the host cell is capable of expressing the DNA 
molecule. Preferred is a host cell selected from the group consisting of a plant cell, a 
bacterial cell, a yeast cell, and an Insect cell. 

The present invention is further directed to plants and the progeny thereof, plant 
tissue and plant seeds tolerant to hert)lcides that inhibit the naturally occurring protox 
activity in these plants, wherein the tolerance is conferred by a gene expressing a modified 
inhibitor-resistant protox enzyme as taught herein. Representative plants include any plants 
to which these hert)icides may be applied for their nonmally Intended purpose. Preferred are 
agronomically important crops, I.e., angiosperms and gymnosperms such as Arabidopsis, 
sugar cane, soybean, bariey, cotton, tobacco, sugar beet, oilseed rape, maize, wheat, 
sorghum, rye, oats, tomato, potato, turf and forage grasses, millet, forage, and rice and the 
like. More preferred are agronomically important crops, i.e., angiospemis and gymnosperms 
such as Arabidopsis, cotton, soybean, oilseed rape, sugar beet, maize, rice, wheat, bariey, 
oats, rye, sorghum, millet, turf, forage, turf grasses. Partlculariy prefen-ed are agronomically 
important crops, i.e., angiosperms and gymnosperms such as Arabidopsis, soybean, cotton, 
sugar beet, oilseed rape, maize, wheat, sorghum, and rice. 

Preferred is a plant comprising the DNA molecule encoding a modified 
protoporphyrinogen oxidase (protox) comprising a plant protox having a first amino acid 
substitution and a second amino acid substitution; the first amino acid substitution having 
the property of conferring resistance to a protox inhibitor; and the second amino acid 
substitution having the property of enhancing the resistance conf en-ed by the first amino 
acid substitution, wherein the DNA molecule is expressed in the plant and confers upon the 
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plant tolerance to a herbicide in amounts that inhibit naturally occurring protox activity. 
Preferred is a plant, wherein the DMA molecule replaces a corresponding naturally 
occurring protox coding sequence. Comprised by the present invention is a plant and the 
progeny thereof comprising the chimeric gene according to the invention, wherein the 
chimeric gene confers upon the plant tolerance to a heri3icide In amounts that inhibit 
naturally occurring protox activity. 

Encompassed by the present invention are transgenic plant tissue, including plants 
and the progeny thereof, seeds, and cultured tissue, stably transfomied with at least one 
chimeric gene according to the invention. Preferred is transgenic plant tissue. Including 
plants, seeds, and cultured tissue, stably transfonned with at least one chimeric gene that 
comprises an expression cassette comprising essentially a promoter, but especially a 
promoter that is active in a plant, operatively linl^ed to the DNA molecule encoding an 
protoporphyrinogen oxidase (protox) enzyme that is resistant to herbicides at levels that 
inhibit the conresponding unmodified version of the enzyme in the plant tissue. 

The present invention is further directed to plants, plant tissue, plant seeds, and 
plant cells tolerant to heriDicides that inhibit the naturally occurring protox activity in these 
plants, wherein the tolerance is conferred by increasing expression of wild-type 
herbicide-sensitive protox. This results in a level of a protox enzyme in the plant cell at least 
sufficient to overcome growth inhibition caused by the herbicide. The level of expressed 
enzyme generally is at least two times, preferably at least five times, and more preferably at 
least ten times the natively expressed amount. Increased expression may be due to 
multiple copies of a wild-type protox gene; multiple occurrences of the coding sequence 
within the gene {i.e. gene amplification) or a mutation In the non-coding, regulatory 
sequence of the endogenous gene in the plant cell. Plants having such altered gene 
activity can be obtained by direct selection in plants by methods known in the art (see, e.g. 
U.S. Patent No. 5,162,602, and U.S. Patent No. 4,761,373, and references cited therein). 
These plants also may be obtained by genetic engineering techniques known in the art. 
Increased expression of a heriDicide-sensitive protox gene can also be accomplished by 
stably transforming a plant cell with a recombinant or chimeric DNA molecule comprising a 
promoter capable of driving expression of an associated stmctural gene in a plant cell 
operatively linked to a homologous or heterologous structural gene encoding the protox 
enzyme. 

The recombinant DNA molecules of the invention can be introduced into the plant 
cell in a number of art-recognized ways. Those skilled in the art will appreciate that the 
choice of method might depend on the type of plant. I.e. monocot or dicot, targeted for 
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transformation. Suitable metliods of transfonming plant cells include microinjection 
(Crossway etai, BioTechniques 4.<320'334 (1986)). electroporation (Riggs ef a/, Proc. NatL 
Acad Scl. USA 83:5602-5606 (1986). Agrobacterium mediated transfomiation (Hinchee et 
al., Biotechnology 6:915-921 (1988)), direct gene transfer (Paszkowski ef a/., EMBO J. 
3.2717-2722 (1984)). ballistic particle acceleration using devices available from Agracetus, 
Inc., Madison, Wisconsin and Dupont, Inc., Wilmington, Delaware (see, for example, 
Sanford ef a/., U.S. Patent 4,945,050; and McCabe ef a/.. Biotechnology 6^23-928 (1988)), 
protoplast transfomiation/regeneratlon methods (see U.S. Patent No. 5.350,689 Issued 
Sept. 27, 1994 to Clba-Qeigy Corp.), and pollen transformation {see U.S. Patent No. 
5,629,183). Also see, Weissinger ef a/., Annual Rev. Genet 22:421-477 (1988); Sanford ef 
a/., Particulate Science and Technology 5.27-37 (1987)(onion); Christou ef a/., Plant 
Physiol. 87:671-674 (1988)(soybean); McCabe etal., Bio/Technology 6:923-926 
(1988)(soybean); Datta ef a/., Bio/Technology a.736-740 (1990)(rice); Klein ef a/., Proc. 
Natl. Acad ScL USA, 85:4305-4309 (1 988)(malze); Klein ef a/., B/o/Tec/ino/ogy 6:559-563 
(1988)(maize); Klein etal., Plant Physiol. 9y:440-444 (1988)(maize); Fromm ef a/., 
Bio/Technology 8 A33'339 (1990); Gordon-Kamm ef aA, Plant Cell 2,603-6^3 (1990) 
(maize); and U.S. Patent Nos. 5,591,616 and 5,679.558 (rice). 

Comprised within the scope of the present invention are transgenic plants, in 
particular transgenic fertile plants transformed by means of the aforedescribed processes 
and their asexual and/or sexual progeny, which still are resistant or at least tolerant to 
inhibition by a herbicide at levels that nomnaily are inhibitory to the naturally occumng 
protox activity In the plant. Progeny plants also Include plants with a different genetic 
background than the parent plant, which plants result from a backcrossing program and still 
comprise In their genome the herbicide resistance trait according to the invention. Very 
especially prefenred are hybrid plants that are resistant or at least tolerant to inhibition by a 
herbicide at levels that normally are inhibitory to the naturally occurring protox activity in the 
plant. 

The transgenic plant according to the invention may be a dicotyledonous or a 
monocotyledonous plant. Prefenred are monocotyledonous plants of the Graminaceae 
family involving Lolium, Zea, Tritlcum, Triticale, Sorghum, Sacchamm, Bromus, Oryzae, 
Avena, Hordeum, Secale and Setaria plants. More preferred are transgenic maize, wheat, 
bariey. sorghum, rye, oats, sugar cane, turf and forage grasses, millet and rice. Especially 
preferred are maize, wheat, sorghum, rye, oats, turf grasses and rice. 
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Among the dicotyledonous plants Arabidopsis, soybean, cotton, sugar beet, oilseed 
rape, tobacco, tomato, potato, and sunflower are more preferred herein. Especially 
preferred are soybean, cotton, tobacco, sugar beet, tomato, potato, and oilseed rape. 

The expression 'progeny' is understood to embrace both, "asexually" and "sexually" 
generated progeny of transgenic plants. This definition is also meant to include all mutants 
and variants obtainable by means of known processes, such as for example cell fusion or 
mutant selection and that still exhibit the characteristic properties of the initial transformed 
plant, together with all crossing and fusion products of the transfomned plant material. This 
also includes progeny plants that result from a backcrossing program, as long as the 
progeny plants still contain the herisicide resistant trait according to the invention. 

Another object of the invention concerns the proliferation material of transgenic 
plants. The proliferation material of transgenic plants is defined relative to the invention as 
any plant material that may be propagated sexually or asexually in vivo or In vitro. 
Particularly prefered within the scope of the present invention are protoplasts, cells, calli, 
tissues, organs, seeds, embryos, pollen, egg cells, zygotes, together with any other 
propagating material obtained from transgenic plants. 

Parts of plants, such as for example flowers, stems, fruits, leaves, roots originating 
In transgenic plants or their progeny previously transformed by means of the process of the 
invention and therefore consisting at least In part of transgenic cells, are also an object of 
the present invention. 

A further object of the invention Is a method of producing plants, protoplasts, cells, 
calli, tissues, organs, seeds, embryos, pollen, egg cells, zygotes, together with any other 
propagating material, parts of plants, such as for example flowers, stems, fruits, leaves, 
roots originating in transgenic plants or their progeny previously transfomied by means of 
the process of the invention, which therefore produce an inhibitor resistant fomri of a plant 
protox enzyme by transforming the plant, plant parts with the DNA according to the 
invention. Prefen-ed is a method of producing a host cell comprising an isolated DNA 
molecule encoding a protein from a eukaryote having protoporphyrinogen oxidase (protox) 
activity comprising transfonming the host cell with a recombinant vector molecule according 
to the invention. Further pref en'ed is a method of producing a plant cell comprising an 
isolated DNA molecule encoding a protein from a eukaryote having protoporphyrinogen 
oxidase (protox) activity comprising transfonning the plant cell with a recombinant vector 
molecule according to the invention. Preferred is a method of producing transgenic progeny 
of a transgenic parent plant comprising an isolated DNA molecule encoding a protein from a 
eukaryote having protoporphyrinogen oxidase (protox) activity comprising transforming the 
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parent plant with a recombinant vector molecule according to the invention and transfening 
the herbicide tolerant trait to the progeny of the transgenic parent plant involving known 
plant breeding techniques. 

Prefen-ed is a method for the production of plants, plant tissues, plant seeds and 
plant parts, which produce an inhibitor-resistant fonn of the plant protox enzyme, wherein 
the plants, plant tissues, plant seeds and plant parts have been stably transformed with a 
structural gene encoding the resistant protox enzyme. Particularly preferred is a method for 
the production of plants, plant tissues, plant seeds and plant parts, wherein the plants, plant 
tissues, plant seeds and plant parts have been stably transfonned with the DNA according 
to the invention. Especially prefen-ed is a method for the production of the plants, plant 
tissues, plant seeds and plant parts, which produce an inhibitor-resistant fonn of the plant 
protox enzyme, wherein the plants, plant tissues, plant seeds and plant parts have been 
prepared by direct selection techniques whereby herbicide resistant lines are isolated, 
characterized and developed. 

The genetic properties engineered into the transgenic seeds and plants described 
above are passed on by sexual reproduction or vegetative growth and can thus be 
maintained and propagated In progeny plants. Generally the maintenance and propagation 
make use of known agricultural methods developed to fit specific purposes such as tilling, 
sowing or harvesting. Specialized processes such as hydroponics or greenhouse 
technologies can also be applied. As the growing crop is vulnerable to attack and damages 
caused by insects or infections as well as to competition by weed plants, measures are 
undertaken to control weeds, plant diseases, insects, nematodes, and other adverse 
conditions to improve yield. These include mechanical measures such a tillage of the soil or 
removal of weeds and infected plants, as well as the application of agrochemicals such as 
herbicides, fungicides, gametocides, nematicides, growth regulants, ripening agents and 
insecticides. 

Use of the advantageous genetic properties of the transgenic plants and seeds 
according to the invention can further be made in plant breeding that aims at the 
development of plants with improved properties such as tolerance of pests, heriDicide 
tolerance, or stress tolerance, improved nutritional value, increased yield, or improved 
structure causing less loss from lodging or shattering. The various breeding steps are 
characterized by well-defined human intervention such as selecting the lines to be crossed, 
directing pollination of the parental lines, or selecting appropriate progeny plants. 
Depending on the desired properties different breeding measures are taken. The relevant 
techniques are well known in the art and include but are not limited to hybridization, 
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inbreeding, backcross breeding, multiline breeding, variety blend, interspecific hybridization, 
aneuploid tecliniques, etc. Hybridization teciiniques also include the sterilization of plants to 
yield male or female sterile plants by mechanical, chemical or biochemical means. Cross 
pollination of a male sterile plant with pollen of a different line assures that the genome of 
the male sterile but female fertile plant will uniformly obtain properties of both parental lines. 
Thus, the transgenic seeds and plants according to the invention can be used for the 
breeding of improved plant lines that for example increase the effectiveness of conventional 
methods such as herbicide or pesticide treatment or allow to dispense with the methods due 
to their modified genetic properties. Alternatively new crops with Improved stress tolerance 
can be obtained that, due to their optimized genetic "equipmenf , yield han/ested product of 
better quality than products that were not able to tolerate comparable adverse 
developmental conditions. 

In seeds production gennination quality and unifonmity of seeds are essential 
product characteristics, whereas gennlnation quality and uniformity of seeds han/ested and 
sold by the fanner is not important. As it is difficult to keep a crop free from other crop and 
weed seeds, to control seedbome diseases, and to produce seed with good germination, 
fairiy extensive and well-defined seed production practices have been developed by seed 
producers, who are experienced in the art of growing, conditioning and marketing of pure 
seed. Thus, it is common practice for the farmer to buy certified seed meeting specific 
quality standards instead of using seed han/ested from his own crop. Propagation material 
to be used as seeds is customarily treated with a protectant coating comprising hert>icides, 
insecticides, fungicides, bactericides, nematicides, molluscicides or mixtures thereof. 
Customarily used protectant coatings comprise compounds such as captan, cart>oxin, 
thiram (TMTD®), methalaxyl (Apron®), and pirimiphos-methyl (Actellic®). If desired these 
compounds are fomnulated together with further carriers, surfactants or application- 
promoting adjuvants customarily employed in the art of formulation to provide protection 
against damage caused by bacterial, fungal or animal pests. The protectant coatings may 
be applied by impregnating propagation material with a liquid fomnulation or by coating with 
a combined wet or dry fomiulation. Other methods of application are also possible such as 
treatment directed at the buds or the f mit. 

It is thus a further object of the present invention to provide plant propagation 
material for cultivated plants, but especially plant seed that is treated with an seed 
protectant coating customarily used in seed treatment. 

It is a further aspect of the present invention to provide new agricultural methods 
such as the methods exemplified above, which are characterized by the use of transgenic 
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plants, transgenic plant material, or transgenic seed according to the present invention. 
Comprised by the present invention is an agricultural method, wherein a transgenic plant or 
the progeny thereof is used comprising a chimeric gene according to the invention in an 
amount sufficient to express herbicide resistant forms of heriDicide target proteins in a plant 
to confer tolerance to the herbicide. 

To breed progeny from plants transformed according to the method of the present 
invention, a method such as that which follows may be used: maize plants produced as 
described in the examples set forth below are grown in pots in a greenhouse or in soil, as is 
known in the art, and permitted to flower. Pollen is obtained from the mature tassel and 
used to pollinate the ears of the same plant, sibling plants, or any desirable maize plant. 
Similarly, the ear developing on the transformed plant may be pollinated by pollen obtained 
from the same plant, sibling plants, or any desirable maize plant. Transformed progeny 
obtained by this method may be distinguished from non-transfonned progeny by the 
presence of the introduced gene(s) and/or accompanying DNA (genotype), or the 
phenotype conf en-ed. The transformed progeny may similariy be selfed or crossed to other 
plants, as is nonnally done with any plant carrying a desirable trait. Similarly, tobacco or 
other transformed plants produced by this method may be selfed or crossed as is known in 
the art in order to produce progeny with desired characteristics. Similarly, other transgenic 
organisms produced by a combination of the methods known in the art and this invention 
may be bred as is known in the art in order to produce progeny with desired characteristics. 

The modified inhibitor-resistant protox enzymes of the invention have at least one 
amino acid substitution, addition or deletion relative to their naturally occuning counterpart 
(I.e. inhibitor-sensitive fomris that occur naturally In a plant without being manipulated, either 
directly via recombinant DNA methodology or indirectly via selective breeding, etc., by man). 
Amino acid positions that may be modified to yield an inhibitor-resistant fomi of the protox 
enzyme, or enhance inhibitor resistance, are indicated in bold type in Table 1A in the 
context of plant protox-1 sequences from Arabidopsis, maize, soybean, cotton, sugar beet, 
oilseed rape, rice, sorghum, wheat, and sugarcane. The skilled artisan will appreciate that 
equivalent changes may be made to any plant protox gene having a stmcture sufficiently 
similar to the protox enzyme sequences shown herein to allow alignment and identification 
of those amino acids that are modified according to the invention to generate inhibitor- 
resistant forms of the enzyme. Such additional plant protox genes may be obtained using 
standard techniques as described in Intemational application no. PCT/IB95/00452 filed 
June 8, 1995, published Dec. 21, 1995 as WO 95/34659 whose relevant parts are herein 
incorporated by reference. 



48 



wo 01/68826 



PCT/EPOl/02753 



DNA molecules encoding the herbicide resistant protox coding sequences taught 
herein may be genetically engineered for optimal expression in a crop plant. This may 
include altering the coding sequence of the resistance allele for optimal expression in the 
crop species of Interest. Methods for modifying coding sequences to achieve optimal 
expression in a particular crop species are well known (see, e.g. Perlak ef a/., Proa Natl. 
Acad Sci. USA 88:3324 (1991); Koziel ef a/., Bio/technoL Jt; 194 (1993)). 

Genetically engineering a protox coding sequence for optimal expression may also 
include operatively linking the appropriate regulatory sequences (i.e. promoter, signal 
sequence, transcriptional temninators). Examples of promoters capable of functioning in 
plants or plant cells (i.e., those capable of driving expression of the associated structural 
genes such as protox in plant cells) Include the cauliflower mosaic virus (CaMV) 1 9S or 35S 
promoters and CaMV double promoters; nopaline synthase promoters; 
pathogenesis-related (PR) protein promoters; small subunit of ribulose bisphosphate 
cari^oxylase (ssuRUBISCO) promoters, heat shock protein promoter from Brassica with 
reference to EPA 0 559 603 (hspBO promoter), Arabidopsis actin promoter and the 
SuperMas promoter with reference to WO 95/14098 and the like. Preferred promoters will 
be those that confer high level constitutive expression or, more preferably, those that confer 
specific high level expression In the tissues susceptible to damage by the hert)icide. 
Prefen'ed promoters are the rice actin promoter (McElroy etal., Mol. Gen, Genet 231: 150 
(1 991 )), maize ubiquitin promoter (EP 0 342 926; Taylor et aL, Plant Cell Rep. 12: 491 
(1993)), and the PR-1 promoter from tobacco. Arabidopsis, or maize (see U.S. Patent No. 
5,614,395 to Ryals ef a/., incorporated by reference herein in its entirety). The promoters 
themselves may be modified to manipulate promoter strength to increase protox 
expression, in accordance with art-recognized procedures. 

The inventors have also discovered that another preferred promoter for use with the 
inhibitor-resistant protox coding sequences Is the promoter associated with the native 
protox gene (i.e. the protox promoter; see copending, co-owned U.S. Patent Application No. 
08/808,323, entitled "Promoters from Protoporphyrinogen Oxidase Genes", incorporated by 
reference herein in its entirety.) The promoter sequence from an Arabidopsis protox-1 gene 
is set forth in SEQ ID N0:1 3, the promoter sequence from a maize protox-1 gene is set 
forth in SEQ ID N0:14, and the promoter sequence from a sugar beet protox-1 gene Is set 
forth in SEQ ID N0:26. 

Since the protox promoter itself is suitable for expression of inhibitor-resistant protox 
coding sequences, the modifications taught herein may be made directly on the native 
protox gene present in the plant cell genome without the need to construct a chimeric gene 
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with heterologous regulatory sequences. Such modifications can be made via directed 
mutagenesis techniques such as homologous recombination and selected for based on the 
resulting herbicide-resistance phenotype (see, e.g. Example 10, Pazkowski etal., EMBOJ, 
7:4021-4026 (1988). and U.S. Patent No. 5,487,992, particularly columns 18-19 and 
Example 8). An added advantage of this approach is that besides containing the native 
protox promoter, the resulting modified gene will also include any other regulatory elements, 
such as signal or transit peptide coding sequences, which are part of the native gene. 

In the event of transfomnation of the nuclear genome, signal or transit peptides may 
be fused to the protox coding sequence in chimeric DNA constructs of the invention to 
direct transport of the expressed protox enzyme to the desired site of action. Examples of 
signal peptides include those natively linked to the plant pathogenesis-related proteins, e.g. 
PR-1 , PR-2, and the like. See, e.g., Payne et aL, Plant MoL Biol. 1 y:89-94 (1 988). 
Examples of transit peptides include the chloroplast transit peptides such as those 
described in Von Heljne eta!., Plant MoL Biol. Rep. 9.104-126 (1991); Mazur etal., Plant 
Physiol. 85: 1 1 1 0 (1 987); Vorst ef al., Gene 65: 59 (1 988), and mitochondrial transit 
peptides such as those described in Boutry etal., Nature 328.^40-342 (1987). Chloroplast 
and mitochondrial transit peptides are contemplated to be particularly useful with the 
present invention as protox enzymatic activity typically occurs within the mitochondria and 
chloroplast. Most preferred for use are chloroplast transit peptides, as inhibition of the 
protox enzymatic activity in the chloroplasts is contemplated to be the primary basis for the 
action of protox-lnhibiting hert^icides (WItkowski and Hailing, Plant Physiol. 87: 632 (1988); 
Lehnen etal., Pestle. Biochem. Physiol. 37: 239 (1990); Duke etal., WeedSci. 39: 465 
(1991)). Also included are sequences that result in localization of the encoded protein to 
various cellular compartments such as the vacuole. See, for example, Neuhaus etal., Proc. 
Natl. Acad. Sci. USA 88: 10362-10366 (1991) and Chrispeels, Ann. Rev. Plant Physiol. 
Plant Mol. Biol. 42. 21-53 (1991). The relevant disclosures of these publications are 
incorporated herein by reference In their entirety. 

Chimeric genes of the invention may contain multiple copies of a promoter or 
multiple copies of the protox structural genes. In addition, the constmct(s) may include 
coding sequences for mariners and coding sequences for other peptides such as signal or 
transit peptides, each in proper reading frame with the other functional elements in the DNA 
molecule. The preparation of such constmcts are within the ordinary level of skill in the art. 

Useful markers include peptides providing herbicide, antibiotic or drug resistance, 
such as, for example, resistance to hygromycin. kanamycin, G418, gentamycin, lincomycin, 
methotrexate, glyphosate, phosphinothricin, or the like. These markers can be used to 
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select cells transformed with the chimeric DNA constructs of the invention from 
untransformed cells. Other useful markers are peptidic enzymes that can be easily 
detected by a visible reaction, for example a color reaction, for example luciferase, 
B-glucuronidase, or Q-galactosidase. 

The method of positive selection of genetically transfomied cells into which a 
desired nucleotide sequence can be incorporated by providing the transfomned cells with a 
selective advantage is herein incorporated by reference as WO 94/20627. 

Where a herbicide resistant protox allele Is obtained via directed mutation of the 
native gene in a crop plant or plant ceil culture from which a crop plant can be regenerated, 
it may be moved into commercial varieties using traditional breeding techniques to develop 
a herbicide tolerant crop without the need for genetically engineering the modified coding 
sequence and transforming it into the plant. Alternatively, the herbicide resistant gene may 
be isolated, genetically engineered for optimal expression and then transformed into the 
desired variety. 

Genes encoding altered protox resistant to a protox inhibitor can also be used as 
selectable markers in plant cell transformation methods. For example, plants, plant tissue 
or plant cells transformed with a transgene can also be transformed with a gene encoding 
an altered protox capable of being expressed by the plant. The thus-transformed cells are 
transferred to medium containing the protox Inhibitor wherein only the transf onmed cells will 
survive. Protox inhibitors contemplated to be particularly useful as selective agents are the 
diphenylethers {e.g. acifluorfen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2-nitrobe2oic acid; 
its methyl ester; or oxyfluorfen, 2-chloro-1-(3-ethoxy-4-nitrophenoxy)-4-(trifluorobenzene)}, 
oxidiazoles, (e.g. oxidiazon, 3-[2,4-dichloro-5-(1-methylethoxy)phenyl]-5-(1,1"dimethylethyl)- 
1,3,4-oxadiazol-2-(3A/)-one), cyclic imides (e.g. S-23142, AA(4-chloro-2-fluoro-5- 
propargyioxyphenyl)-3.4,5,6-tetrahydrophthalimide; chlorophthalim. A/^(4-chlorophenyl)- 
3,4,5,6-tetrahydrophthalimide). phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1 -(2,3,4- 
trichlorophenyl)-4-nitropyrazolyl-5-oxy]propionate; M&B 39279). pyridine derivatives, such 
as e.g. pyridyl amides (e.g. LS 82-556), and phenopylate and its O-phenyipynrolidino- and 
piperidinocarbamate analogs and triazolones, such as bicyclic triazolones as disclosed in 
the International patent application WO 92/04827; EP 532146). 

The method is applicable to any plant cell capable of being transformed with an 
altered protox-encoding gene, and can be used with any transgene of interest. Expression 
of the transgene and the protox gene can be driven by the same promoter functional on 
plant cells, or by separate promoters. 
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Modified inhibitor-resistant protox enzymes of the present invention are resistant to 
herbicides that inhibit the naturally occurring protox activity. The herbicides that inhibit 
protox include many different structural classes of molecules (Duke et aL, Weed ScL 39: 
465 (1991); Nandihalli ef a/., Pesticide Biochem. Physiol. 43; 193 (1992); Matringe ef a/., 
FEBS Lett. 245: 35 (1989); Yanase and Andoh, Pesticide Biocliem. Physiol. 35: 70 (1989)), 
including the diphenylethers {e.g. acifluorifen, 5-[2-chloro-4-(trifluoromethyl)phenoxy]-2- 
nitrobezoic acid; its methyl ester; or oxyfluorfen, 2-ch!oro-1-(3-ethoxy-4-nitrophenoxy)-4- 
(trifluorobenzene)}, oxidiazoles (e.g. oxidiazon, 3-[2,4-dichloro-5-(1-methylethoxy)phenyl]-5- 
(1.1-dimethylethyl)-1.3.4-oxadia2ol-2-(3H)-one), cyclic imides (e.g. S-23142, A/^(4-chloro-2- 
fluoro-5-propargyloxyphenyl)-3,4.5.6-tetrahydrophthalimide; chlorophthaiim, A/-(4- 
chlorophenyl)-3,4,5.6-tetrahydrophthalimide). phenyl pyrazoles (e.g. TNPP-ethyl, ethyl 2-[1- 
(2,3,4-trichlorophenyl)-4-nitropyrazolyl-5-oxy]propionate; M&B 39279), pyridine derivatives 
(e.g. LS 82-556), and phenopylate and its Ophenylpyrrolidino- and piperidinocari^amate 
analogs. 

The diphenylethers of particular significance are those having the general fomnula 




(Formula I) 



wherein R equals -COONa (Fonrtula II), -CONHSOzCHa (Formula III) or -COOCH2COOC2H5 
(Formula IV; see Maigrot etal., Brighton Crop Protection Conference-Weeds: Al-S^ (1989)), 
Additional diphenylethers of interest are those where R equals: 

NOCH2COOCH3 
CCH2OCH3 

(Fonmula IVa; see Hayashi ef a/., Brighton Crop Protection Conference-Weeds: 53-58 
(1989)). 

An additional diphenylether of interest is one having the formula: 

COOCH3 




CI 



(Formula IVb; bifenox, seeDest etal., Proc. Northeast Weed Sci. Conf. 27:31 (1973)). 
A further diphenylether of interest is one having the fomnula: 
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OCH2CH3 




(Formula IVc; oxyfluorfen; see Yih and Swithenbank, J. Agric. Food Chem., 23: 592 (1975)) 
Yet another diphenylether of interest is one having the fomnula: 

CH3 

CO2CHCO2CH2CH3 




(Fomriula IVd; lactofen, see page 623 of 'The Pesticide l^/ianual", 10*^ ed., ed. by C. Tomlin. 
British Crop Protection Council, Surrey (1994)) 

Also of significance are the class of herbicides known as imides, having the general 
formula 




^ (Formula V) 



wherein Q equals 
O 



ir >- 

o (Formula VI) 



or 



N- (Fonnula VII) 



or 



S3 
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o (Formula VIII) 

or 

T 'n— 

HRC II 

o (Formula IX) 



or 



CI 




CH3 



CF3 

(Formula IXa) 



or 



CI 




OCHF2 



N 

(Formula IXb) 

(see Hamper et al. (1995) in "Proceedings of the Eighth Intemational Congress of Pesticide 
Chemistry", Ragdale etai, eds., Amer. Chem. Soc, Washington, D.C., pp.42-48 (1994)); 
and Ri equals H, CI or F, Ra equals CI and R3 is an optimally substituted ether, thioether, 
ester, amino or allcyl group. Altematively, R2 and R3 together may form a 5 or 6 membered 
heterocyclic ring. Examples of imide herbicides of particular interest are 

R 




O 

(Formula Vila; fiuthiacet-methyl, see Miyazawa ef a/., Brighton Crop Protection Conference- 
Weeds, pp. 23-28 (1993)) 
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ChbS02NH (Formula X sulfentrazone, see Van Saun ef a/„ 

Brighton Crop Protection Conference-Weeds, pp. 77-82 (1991)). 




"^^N^ "OCHF2 



(Formula XII) 
(see Miura ef a/., Brighton Crop Protection 
Conference-Weeds: 35-40 (1993)) 



» S F 

"schfecoochfe (Formula XIII) 




o 

0^ 



O F 

6 0Chl2C00C%H„ (FomiulaXIV) 
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(Formula XVI) 



The herbicidal activity of the above compounds is described in the Proceedings of 
the 1991 Brighton Crop Protection Conference, Weeds (British Crop Protection Council) 
(FomDulae X and XVI), Proceedings of the 1993 Brighton Crop Protection Conference, 
Weeds (British Crop Protection Council) (Formulae XII and XIII), U.S. Patent No. 4.746,352 
(Fomiula XI) and Abstracts of the Weed Science Society of America vol. 33, pg. 9 
(1993)(Formula XIV). 

The most preferred imide heriDicides are those classified as aryluracils and having 
the general fonnula 




COOR (Formula XVII) 
wherein R signifies the group (C2-6-all<enyloxy)carbonyI-Ci^-alkyl, as disclosed in U.S. 
Patent No. 5,183,492, herein incorporated by reference. 

Also of significance are heriDicides having the general formula: 




(Fonnula XVIII; thiadiazimin) 
(see Weiler etal., Brighton Crop Protection Conference- 
Weeds, pp. 29-34 (1993)); 



56 



wo 01/68826 



PCT/EPOl/02753 



CI 



CH3CH2O. 




N-CHF2 



CH3 



(Formula XIX; carfentrazone) 
(see Van Saun et ai, Brighton Crop Protection Conference- 
Weeds: pp. 19-22(1993)); 
N-substituted pyrazoles of the general formula: 




(Formula XX) 

wherein Ri is Ci-C4-alkyl, optionally substituted by one or more halogen atoms; 

R2 is hydrogen, or a CrC4-alkoxy, each of which is optionally substituted by one or 
more halogen atoms, or 

Ri and R2 together from the group -(CH2)n-X-, where X is bound at R2; 
R3 is hydrogen or halogen, 
R4 is hydrogen or Ci-C4-aIkyl, 

R5 is hydrogen, nitro, cyano or the group -COORe or -CONRyRa, and 

Re is hydrogen, CrCs-alkyI, C2-C8-alkenyl or CrCe-alkynyl; 

(see international patent publications WO 94/08999, WO 93/10100, and U. S. 
Patent No. 5,405,829 assigned to Sobering); 
N-phenylpyrazoles, such as: 
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NQ2 



NH2 
.CI 



C 



CF3 



(Formula XXI; nipyraclofen) 



(see page 621 of "The Pesticide Manual", 9th ed., ed. by C.R. Worthing, 



British 



Crop Protection Council, Surrey (1991)); 



and 3-substituted-2-aryl-4,5,6,7-tetrahydroindazoIes (Lyga et al. Pesticide Sci. 42.29-36 
(1994)). 



CHF2 

(Fonmula XXIa; BAY 11340) 

Also of significance are phenylpyrazoles of the type described in WO 96/01254 and 
WO 97/00246, both of which are hereby incorporated by reference. (Formula XXII). 

Also of significance are pyridyl pyrazoles such as the following: 




NO2 



CI 
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F CI 



F CI. 




CGA 374410 



(Formula XXIIIa) 




\ 

NOA 401954 
(Formula XXIIIb) 



Also of significance are plienylpyrazoies having ttie general formula: 




P3C 



(Formula XXIV: Fluazolate) 
{see Prosch, S. D. et at., Brighton Crop Protection Conference- 
Weeds: pp. 45-50 (1997), (Vol. 1)) 

Additional compounds of significance are described in W098/33927 and US patent 
5,856,495, both of which are incorporated herein by reference in their entirety. 

Additional compounds of significance are also, for example, Chlornitrofen (4- 
nitropheny! 2,4,6-trichlorophenylether), Ethoxyfen (ethyl 0-[2-chloro-5-(2-chloro-a,a,a,- 
trifluoro-p-toly!oxy)benzoyl]-L-!actate), Fomriesafen (5-(2-ch!oro-a,a,a-trifluoro-p-tolyloxy)-N- 
methyl-sulfonyl-2-nitro-benzamide), Azafenidin (2-(2,4-dichioro-5-prop-2-ynyloxyphenyl)- 
5,6,7,8-tetrahydro-1 ,2,4-triazolo[4,3-a]pyridin-3-(2H)-one), Butafenacil or Fluobutracil (2-[2- 
chloro-5-(3,6-dihydro-2,6-dioxo-3-methyl-4-trifluoromethyl-1-(2H)-pyrimidinyl)-benzoyloxy]-2- 
methyl-propionic acid allyl ester), Cinidon-ethyl (2-chloro-3-[2-chloro-5-(cyclohex-1-en-1,2- 
dicarboximido)phenyl]acrylic acid ethyl ester), Fluthiacet-methyl (methyl[2-chloro-4-fluoro-5- 
(5,6,7,8-tetrahydro-3-oxo-1 H,3H-[1 ,3,4]thiadiazolo[3,4 a] pyridazin-1 -ylideneamino) 
phenylthiojacetate), Oxadiargyl (5-tert.-butyl-3-[2,4-dichIoro-5-(prop-2-ynyloxy)phenyl]- 
1 ,3,4-oxadiazol-2(3H)-one), Oxadiazon (5-tert.-butyl-3-(2,4-dichloro-5-isopropoxyphenyl)- 
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1 ,3,4-oxa-clia2ol-2(3H)-one), Pentoxazone (3-(4-chloro-5-cyclopentyloxy-2-fluorophenyl)-5- 
isopropylidene-1 ,3-oxazolidine-2,4-dione). Pyraf lufen-ethyl (ethyl 2 chloro-5-(4-chloro-5- 
difluoromethoxy-1-methyl-pyrazol-3-yl)-4-fluorophenoxy acetate), Rumipropyn ((+)-N-[4- 
chloro-2-fluoro-5-(1 -methyIprop-2-ynyl)oxy phenyl]cyclohex-1 -ene-1 ,2-dicarboximide). 
Flupropacil (2-chloro-5-(1 ,2,3,6-tetrahydro-3-methyl-2,6-dioxo''4-trif luoromethyl-pyrimidin-'l - 
yl)benzoic acid isopropyl ester), Thidiazimin ((Z)-6-(6,7-dihydro-6,6-dimethyl-3H,5H-pyrrolo-[ 
2J-c]thiadia2ol-3-ylideneamino)-7-fluoro-4-(2-propynyl)-2H-1,4-benzoxazin-3-(4H)-on 
Benzfendizone (methyl 2^ 5-ethyl-2-[4-(1 ,2,3,6-tetrahydro-3-methyl-2,6-dioxo-4- 
trifluoromethyl-pyrimidin-1-yl)phenoxymethyl]phenoxy ^propionate), Pyrazogyl or Pyraclonil 
(1 -(3-chloro-4.5,6,7-tetrahydropyrazolo[1 ,5-a]pyridin-2-yl)-6-(methyl-2-propynylamino)-1 H- 
pyrazole-4-carbonitirile), Profluazole (1-chloro-N-[2-chloro-4-fluoro-5-[(6S, 7aR)-6- 
fluorotetrahydro-1 ,3-dioxo-1 H-pyrrolo[1 ,2-c]imidazol-2(3H)-yl]phenyl] methanesulfonamide). 

An overview of such compounds is also found in Teroxidizing Herbicides'", ed. by 
Springer, Berlin, eds. P. Boger & K. Wakabayashi, 1999. 

Levels of herbicide that normally are inhibitory to the activity of protox include 
application rates known in the art, and that depend partly on external factors such as 
environment, time and method of application. For example, in the case of the imide 
herbicides represented by Fonmulae V through IX, and more particularly those represented 
by Formulae X through XVII, the application rates range from 0.0001 to 10 kg/ha, preferably 
from 0.005 to 2 kg/ha. This dosage rate or concentration of herisicide may be different, 
depending on the desired action and particular compound used, and can be determined by 
methods known in the art. 

A further object of the invention is a method for controlling the growth of undesired 
vegetation that comprises applying to a population of the plant selected from a group 
consisting of Arabidopsis, sugar cane, soybean, bariey, cotton, tobacco, sugar beet, oilseed 
rape, maize, wheat, sorghum, rye, oats, turf and forage grasses , millet, forage and rice and 
the like an effective amount of a protox-inhibiting heriDicide. Preferred is a method for 
controlling the growth of undesired vegetation, which comprises applying to a population of 
the selected from the group consisting of selected from the group consisting of soybean, 
cotton, tobacco, sugar beet, oilseed rape, maize, wheat, sorghum, rye, oats, turf grasses 
and rice an effective amount of a protox-inhibiting herbicide. Particularly preferred is a 
method for controlling the growth of undesired vegetation, which comprises applying to a 
population of the selected from the group consisting of Arabidopsis, soybean, cotton, sugar 
beet, oilseed rape, maize, wheat, sorghum, and rice. 
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III. Plastid Transformation and Expression 

The present invention further encompasses a chimeric gene comprising a promoter 
capable of expression in a plant plastid operatively linked to a DNA molecule of the present 
invention. A preferred promoter capable of expression in a plant plastid is a promoter 
isolated from the 5' flanking region upstream of the coding region of a plastid gene, which 
may come from the same or a different species, and the native product of which is typically 
found in a majority of plastid types including those present in non-green tissues. Examples 
of such promoters are promoters of c/pP genes, such as the tobacco c/pP gene promoter 
(WO 97/06250, incorporated herein by reference) and the Arabidopsis c/pP gene promoter 
(U.S. Application No. 09/038,878, incorporated herein by reference). Other promoters that 
are capable of expressing a DNA molecule in plant plastids are promoters recognized by 
viral RNA polymerases. Preferred promoters of this type are promoters recognized by a 
single sub-unit RNA polymerase, such as the T7 gene 10 promoter, which Is recognized by 
the bacteriophage T7 DNA-dependent RNA polymerase. Yet another promoter that is 
capable of expressing a DNA molecule in plant plastids comes from the regulatory region of 
the plastid 16S ribosomal RNA operon (Hams ef a/., Microbiol, Rev. 58:700-754 (1994), 
Shinozaki et at., EMBO J. 5:2043-2049 (1986), both of which are incorporated herein by 
reference). The gene encoding the T7 polymerase is preferably transfonned into the 
nuclear genome and the T7 polymerase is targeted to the plastids using a plastid transit 
peptide. Expression of the DNA molecules in the plastids can be constitutive or can be 
inducible. These different embodiment are extensively described in WO 98/11235, 
incorporated herein by reference. The chimeric gene preferably further comprises a 5* 
untranslated sequence (5' UTR) functional in plant plastids and a plastid gene 3' 
untranslated sequence (3' UTR) operatively linked to a DNA molecule of the present 
Invention. Preferably, the 3* UTR is a plastid rps\Q gene 3' untranslated sequence. In a 
further embodiment, the chimeric gene comprises a poly-G tract instead of a 3' untranslated 
sequence. 

The present invention also encompasses a plastid transfomiation vector comprising 
the chimeric gene described above and flanking regions for integration into the plastid 
genome by homologous recombination. The plastid transformation vector may optionally 
comprise at least one chloroplast origin of replication. The present invention also 
encompasses a plant plastid transfomied with such a plastid transfonmation vector, wherein 
the DNA molecule is expressible in the plant plastid. The invention also encompasses a 
plant or plant cell, including the progeny thereof, comprising this plant plastid. In a prefenred 
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embodiment, the plant is homoplasmic for transgenic plastids. The plants transformed in the 
present invention may be monocots or dicots. A preferred monocot is maize and a 
prefen^ed dicot is tobacco. Other prefenred dicots are tomato and potato. 

In a preferred embodiment, the present invention encompasses a chimeric gene 
comprising a promoter capable of expression in a plant plastid operatively linked to a DNA 
molecule isolated from a prokaryote or a eukaryote that encodes a native or modified protox 
enzyme, such as a DNA molecule that encodes a native or modified wheat, soybean, 
cotton, sugar beet, oilseed rape, rice, sorghum, or sugar cane protox enzyme. Such a DNA 
molecule is comprised in a plastid transformation vector as described above and plants 
homoplasmic for transgenic plastid genomes are produced. Expression in plant plastids of a 
DNA molecule that encodes a modified protox enzyme preferably confers upon the plant 
tolerance to a herbicide in amounts that inhibit naturally occurring protox activity. 

In a further pref en-ed embodiment, the present invention encompasses a chimeric 
gene comprising (a) a DNA molecule isolated from a plant, which in its native state encodes 
a polypeptide that comprises a plastid transit peptide, and a mature enzyme that is natively 
targeted to a plastid of the plant by the plastid transit peptide, wherein the DNA molecule Is 
modified such that it does not encode a functional plastid transit peptide; and (b) a promoter 
capable of expressing the DNA molecule in a plastid, wherein the promoter Is operatively 
linked to the DNA molecule. In one preferred embodiment, the transit peptide is mutated 
and thus does not allow the proper transport of the enzyme encoded by the DNA molecule 
to the desired cell compartment, such as the plastid. In another preferred embodiment, a 
portion of the transit peptide coding sequence or the entire transit peptide coding sequence 
is removed from the DNA molecule, preventing the enzyme from being properly targeted to 
the desired cell compartment. 

The chimeric genes described above are inserted in plastid transformation vectors, 
and the present invention is therefore also directed to plants having their plastid genome 
transfomned with such vectors, whereby the DNA molecule Is expressible in plant plastids. 
Such plants are preferably homoplasmic for transgenic plastids. 

In a preferred embodiment, a DNA molecule described immediately above encodes 
an enzyme that in its wild-type form is inhibited by a hertDicide. In a further preferred 
embodiment, the DNA molecule encodes an enzyme that in its wild-type form is inhibited by 
a heri3icide, but that comprises at least one amino acid change compared to the wild-type 
enzyme. Such an amino acid change makes the enzyme resistant to compounds that 
naturally inhibit the wild-type enzyme. In a further preferred embodiment, the DNA molecule 
encodes an enzyme having protoporphyrinogen oxidase (protox) activity. In a further 
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preferred embodiment, the transit peptide is removed from the DNA molecule as further 
illustrated in Examples 37-42. Plants homoplasmic for transgenic plastids of the invention 
are resistant to high amounts of herbicides such as Fonmula XVII that inhibit the naturally 
occumng protox activity (as further illustrated in Example 44). 

In another preferred embodiment, the transit peptide of a DNA molecule encoding a 
5-enolpyruvyl-3-phosphoshikimate synthase (EPSP synthase) is mutated or removed. The 
resulting DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genomes are obtained. These 
plants are resistant to herbicldal compounds that naturally inhibit EPSP synthase, in 
particular glyphosate. In another preferred embodiment, the transit peptide of a DNA 
molecule encoding a acetolactate synthase (ALS) is mutated or removed. The resulting 
DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genome are obtained. These 
plants are resistant to herbicldal compounds that naturally inhibit ALS, in particular 
sulfonylureas. In another preferred embodiment, the transit peptide of a DNA molecule 
encoding a acetoxyhydroxyacid synthase (AHAS) Is mutated or removed. The resulting 
DNA molecule is fused to a promoter capable of expression in plant plastids and 
homoplasmic plants harboring such constructs in their plastid genome are obtained. These 
plants are resistant to herbicldal compounds that naturally inhibit AHAS, in particular, 
imidazolinone and sulfonamide herbicides. In another preferred embodiment, the transit 
peptide of a DNA molecule encoding an acetylcoenzyme A carboxylase (ACCase) is 
mutated or removed. The resulting DNA molecule is fused to a promoter capable of 
expression in plant plastids and homoplasmic plants harboring such constructs In their 
plastid genome are obtained. These plants are resistant to herbicldal compounds that 
naturally inhibit ACCase, in particular cyclohexanedione and arylphenoxypropanoic acid 
herbicides. In another prefen^ed embodiment, Oie transit peptide of a DNA molecule 
encoding a glutamine synthase (GS) is mutated or removed. The resulting DNA molecule is 
fused to a promoter capable of expression in plant plastids and homoplasmic plants 
harboring such constmcts in their plastid genome are obtained. These plants are resistant 
to herbicldal compounds that naturally inhibit GS, in particular phosphinothricin and 
methionine sulfoximine. 

The present invention is also further directed to a method of obtaining herbicide- 
resistant plants by transforming their plastid genome with a chimeric gene comprising (a) a 
DNA molecule Isolated from a plant, which in its native state encodes a polypeptide that 
comprises a plastid transit peptide, and a mature enzyme that is natively targeted to a 
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plastid of the plant by the plastid transit peptide, wherein the DMA molecule is modified 
such that it does not encode a functional plastid transit peptide; and (b) a promoter capable 
of expressing the DNA molecule in a plastid, wherein the promoter is operatively linked to 
the DNA molecule. Examples of enzymes that arq used in the present invention are cited 
immediately above, but the applicability of such a method is not limited to the cited 
examples. 

The present invention is still further directed to a novel method for selecting a 
transplastomic plant cell, comprising the steps of: introducing the above-described chimeric 
gene into the plastome of a plant cell; expressing the encoded enzyme in the plastids of the 
plant cell; and selecting a cell that is resistant to a herbicldal compound that naturally 
inhibits the activity of the enzyme, whereby the resistant cell comprises transformed 
plastids. In a preferred embodiment, the enzyme is naturally inhibited by a herbicidal 
compound and the transgenic plant is able to grow on an amount of the herbicidal 
compound that naturally inhibits the activity of the enzyme. In a further preferred 
embodiment, the enzyme has protoporphyrinogen oxidase (protox) activity and Is modified 
so that it that confers resistance to protox inhibitors. 

A further aspect of the present invention is a novel method for plastid transformation 
of recalcitrant plants. The methods pioneered for plastid transformation of tobacco and 
lower plant species rely on non-lethal selection for resistance to antibiotics that 
preferentially affect the plastid translational apparatus and hence allow photo-heterotrophic 
transformants to outgrow heterotrophic, non-transformed tissue. 

Several factors have likely contributed to the difficulties encountered with plastid 
transfomnation of monocots and other dicots. For example, the maize chloroplast 16S 
ribosomal RNA (rRNA) is naturally resistant to spectinomycin because of the presence of a 
G at position 1 1 38 in the Zea mays 1 6S rDNA gene (Hams ef a/., 1 994). Thus, utilization of 
16s rRNA point mutations that confer spectinomycin and/or streptomycin resistance which 
have been used successfully as selectable chloroplast markers in Chlamydomonas ar\6 
tobacco (Boynton and Gillham (1993) /n Wu, R. [Ed.] Methods in Enzymology\lo\ 217. 
Academic Press, San Diego, pp. 510-536; Svab etaL (1990) Proc. Natl. Acad. Sci. U.S.A. 
87: 8526-8530) is not feasible for maize. Natural spectinomycin and streptomycin 
resistance in maize also obviates the use of the bacterial aadA gene encoding 
aminoglycoside 3"-adenyltransferase, which results in dominant spectinomycin and 
streptomycin resistance and allows a 100-fold increase in tobacco chloroplast 
transformation efficiency (Svab and Maliga (1993) Proc. Natl. Acad. Sci. U.S.A. 90: 913- 
917). Use of kanamycin (the only other antibiotic proven to be useful for chloroplast 
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transformation) is also problematic due to a large excess (ca. 50:1) of nuclear vs. 
chloroplast-encoded resistance in tobacco following bombardment of the bacterial nptll 
gene encoding neomycin phosphotransferase (Carrer etaL (1993) MoL Gen, Genet 241: 
49-56). This has been shown to result from both a high frequency of spontaneous nuclear 
resistance mutants as well as integration of nptll into the nuclear genome. Since nptll is 
also a highly effective selectable marker for maize nuclear transfonmation it is reasonable to 
expect similar background levels to that obsen/ed in tobacco. Spontaneous resistance and 
a significant excess of selectable marker integration by random, Illegitimate recombination 
into the nuclear genome, rather than homologous integration into the chloroplast genome, 
would make recovery of bona fide chloroplast transfonnants difficult if not impossible. 

A more fundamental reason for the difficulties encountered with plastid 
transfomiation in plant species other than tobacco may have to do with the non- 
photosynthetic nature of many regenerable cultured plant tissues, especially in maize and 
Arabldopsis. Tobacco is an exception in that cultured vegetative tissues are regenerable 
and contain mature differentiated chloroplasts that are photosynthetically competent in the 
presence of sucrose. Consequently, the cun^ent system for selecting tobacco plastid 
transfomnants relies on the faster growth rate of transfonmed cells that can use both 
reduced and inorganic carbon sources. Moreover, transfomried cells do not suffer the 
chloroplast membrane damage that results from inhibition of plastid protein synthesis in the 
light. This expression of selectable markers that act preferentially on photosynthetic cells, 
driven by promoters that have high activity in differentiated chloroplasts, is unlikely to work 
in non-green tissues containing proplastids (e.g. dark-grown maize Type I callus, somatic 
embryos) or amyioplasts/leucoplasts (e.g. Arabidopsis root cultures). Plastid transformation 
in these plants requires a selectable marker that gives strong selection in all plastid types. 

A preferred selectable marker for generalized plastid transformation: (1) is active 
only in the plastid to eliminate nuclear-transformed "escapes"; (2) has a mode of action that 
does not depend on photosynthetic competence or the presence of fully differentiated 
chloroplasts; and (3) has a level of resistance that is co-dependent on an adjustable 
external parameter (e.g. light), rather than being determined solely by the bulk 
concentration of a selective agent, so that selection pressure can vary during selection to 
facilitate segregation of the many-thousand plastid genome copies. 

In a preferred embodiment, such a selectable marker gene involves the use of a 
chimeric gene comprising an isolated DNA molecule encoding a plastid-targeted enzyme 
having in its natural state a plastid transit peptide, wherein the DNA molecule is modified 
such that the transit peptide either is absent or does not function to target the enzyme to the 
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plastid, wherein the DNA molecule is operatively linked to a promoter capable of expression 
in plant plastids. In a pretended embodiment, a DNA molecule of the present invention 
encodes an enzyme that is naturally inhibited by a herbicide. In another preferred 
embodiment, the DNA molecule encodes a protoporphyrinogen IX oxidase ("protox"). In a 
preferred embodiment, the protoporphyrinogen IX oxidase gene is from Arabidopsis 
thaliana and in a more prefen^ed embodiment, the protoporphyrinogen IX oxidase gene is 
from Arabidopsis thaliana and comprises at least one amino acid substitution. Preferably, 
an amino acid substitution results In tolerance of the enzyme against inhibition by an 
herbicide which naturally inhibits the activity of the enzyme. Low concentrations of herbicide 
are thought to kill wildtype plants due to light-sensitive intennediates which build up when 
the plastid-localized protox enzyme is inhibited. Production of these photosensitizing 
compounds does not require differentiated chloroplasts or active photosynthesis, which is a 
key factor for successful plastid transformation of plants whose regenerable cultured tissues 
are of non-photosynthetic nature. 

Another key feature is to have expression of the selectable marker gene in non- 
green plastids. In a prefen-ed embodiment, the invention encompasses the use of 
promoters that are capable of expression of operatively linked DNA molecules in plastids of 
both green and non-green tissue. In particular, one such promoter comes from the 
regulatory region of the plastid 16S ribosomal RNA operon. Another candidate is the 
promoter and 5' UTR from the plastid cIpP gene. The cIpP gene product is expressed 
constitutively in plastids from all plant tissues, including those that do not contain 
chloroplasts (Shanklin (1 995) Plant Cell 7: 1 71 3-22). 

Other DNA molecules may be co-introduced in plant plastids using the method 
described above. In a pretended embodiment, a plastid transfomnation vector of the present 
invention contains a chimeric gene allowing for selection of transformants as described 
above and at least one other gene fused to a promoter capable of expression in plant 
plastids. The other such gene may, for example, confer resistance to insect pests, or to 
fungal or bacterial pathogens, or may encode one or more value-added traits. 



The invention will be further described by reference to the follov\ring detailed 
examples. These examples are provided for purposes of illustration only, and are not 
intended to be limiting unless othenA^ise specified. Standard recombinant DNA and 
molecular cloning techniques used here are well known in the art and are described by 
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Ausubel (ed.), Current Protocols in Molecular Biology, John Wiley and Sons, Inc. (1994); T. 
Maniatis, E. F. Fritsch and J. Sambrook, Molecular Cloning: A Laboratory Manual, Cold 
Spring HariDor laboratory. Cold Spring Haribor, NY (1989); and by T.J. Silhavy, M.L. Bennan. 
and L.W. Enquist, Experiments with Gene Fusions, Cold Spring Hari^or Laboratory, Cold 
Spring HariDor, NY (1984). 

Section A. Isolation And Characterization Of Plant 
Protoporphyrinogen Oxidase (Protox) Genes 

Example 1: Isolation of a Wheat Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 

Total RNA prepared from Triticum aestivum (cv Kanzler) was submitted to Clontech 
for custom cDNA library construction in the Lambda Uni-Zap vector. Approximately 50,000 
pfu of the cDNA library were plated at a density of approximately 5,000 pfu per 10 cm Petri 
dish and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and 
Schuell). The plaque lifts were probed with the maize protox-1 cDNA (SEQ ID N0:5; see 
Example 2 of Intemational application no. PCT/IB95/00452, filed June 8, 1995. published 
Dec. 21, 1995 as WO 95/34659) labeled with 32P-dCTP by the random priming method 
(Life Technologies). Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M 
NaP04 pH 7.0, 1 mM EDTA at 50°C. Wash conditions were 2X SSC, 1% SDS at 50**C. 
(Church and Gilbert, Proa Natl. Acad. Sci. USA 87; 1991-1995 (1984), hereby incorporated 
by reference in its entirety.) Positively hybridizing plaques were purified and in vivo excised 
Into pBluescript plasmids. The sequences of the cDNA inserts were detenmined by the 
chain termination method using dideoxy terminators labeled with fluorescent dyes (Applied 
Biosystems, Inc.). The longest wheat protox-1 cDNA obtained from initial screening efforts, 
designated "wheat protox-1", was 1489-bp in length. Wheat protox-1 lacks coding 
sequence for the transit peptide plus approximately 126 amino acids of the mature coding 
sequence based on comparison with the other known plant protox peptide sequences. 

A second screen was perfomned to obtain a longer wheat protox cDNA. For this 
screen, a Triticum aestivum (cv Kanzler) cDNA library was prepared intemally using the 
lambda Uni-Zap vector. Approximately 200,000 pfu of the cDNA library was screened as 
indicated above, except that the wheat protox-1 cDNA was used as a probe and 
hybridization and wash conditions were at 65°C instead of 50**C. The longest wheat cDNA 
obtained from this screening effort, designated "wheat protox-1 a", was 1811 -bp in length. 
The nucleotide sequence of this cDNA and the amino acid sequence it encodes are set 
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forth in SEQ ID N0s;9 and 10, respectively. Based on comparison with the other known 
plant protox peptide sequences and with conresponding genomic sequence, this cDNA is 
either full-length or missing only a few transit peptide codons (Table 1 A). This wheat 
protein sequence is 91% identical (95% similar) to the maize protox-1 protein sequence set 
forth in SEQ ID NO:6. 

Wheat protox-1 a, In the pBluescript SK vector, was deposited March 19, 1996, as 
pWDC-13 (NRRL #821545). 

Example 2: Isolation of a Soybean Protox-1 cDNA Based on Sequence Homology to an 

Arabldopsis Protox-1 Coding Sequence 
A Lambda Uni-Zap cDNA library prepared from soybean (v Williams 82, epicotyls) 
was purchased from Stratagene. Approximately 50,000 pfu of the library was plated at a 
density of approximately 5,000 pfu per 10 cm Petri dish and duplicate filter lifts were made 
onto Colony/Plaque Screen membranes (NEN Dupont). The plaque lifts were probed with 
the i4ra/)/c/ops/s protox-1 cDNA (SEQ ID N0:1; see Example 1 of Intemational application 
no. PCT/IB95/00452, filed June 8, 1995, published Dec. 21, 1995 as WO 95/34659)) 
labeled with 32P-dCTP by the random priming method (Life Technologies). Hybridization 
conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 
50** C. Wash conditions were 2X SSC, 1% SDS at 50*' C. (Church and Gilbert (1984)). 
Positively hybridizing plaques were purified and in vivo excised into pBluescript plasmids. 
The sequence of the cDNA inserts was determined by the chain termination method using 
dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
soybean cDNA obtained, designated "soybean protox-1", is full-length based on comparison 
with the other known plant protox peptide sequences (Table 1 A). Soybean protox-1 is 
1847-bp in length and encodes a protein of 58.8 kDa. The nucleotide sequence of this 
cDNA and the amino acid sequence it encodes are set forth in SEQ ID N0s:1 1 and 12, 
respectively. The soybean protein is 78% identical (87% similar) to the Arabldopsis protox- 
1 protein. 

Soybean protox-1, in the pBluescript SK vector, was deposited December 15, 1995 
as pWDC-12 (NRRL #B-21516). 

Example 3: Isolation of a Cotton Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 
A Lambda Uni-Zap cDNA library prepared from Gossypium hirsutum L (72 hr. dark 
grown cotyledons) was obtained from Dr. Dick Trelease, Dept. of Botany, Arizona State 
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University (Ni W. and Trelease R.N., Arch. Biochem. Biophys. 289: 237-243 (1991)). 
Approximately 50,000 pf u of the library was plated at a density of approximately 5,000 pfu 
per 10 cm Petri dish and duplicate filter lifts were made onto Colony/Plaque Screen 
membranes (NEN Dupont). The plaque lifts were probed with the maize protox-1 cDNA 
(SEQ ID N0:5) labeled with 32P-dCTP by the random priming method (Life Technologies). 
Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50° C. Wash conditions were 2X SSC, 1% SDS at So"* C. (Church and Gilbert 
(1984)). Positively hybridizing plaques were purified and in vivo excised into pBluescript 
plasmids. The sequence of the cDNA inserts was detemnined by the chain tennination 
method using dideoxy temninators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest cotton cDNA obtained, designated "cotton protox-1", appears to be full-length 
based on comparison with the other known plant protox peptide sequences (Table 1 A). 
Cotton protox-1 is 1826-bp in length and encodes a protein of 58.2 kDa. The nucleotide 
sequence of this cDNA and the amino acid sequence it encodes are set forth in SEQ ID 
N0s:13 and 14, respectively. The cotton protein is 77% identical (86% similar) to the maize 
protox-1 protein. 

Cotton protox-1 , in the pBluescript SK vector, was deposited July 1, 1996 as pWDC- 
15(NRRL #B-21594). 

Example 4: Isolation of a Sugar Beet Protox-1 cDNA Based on Sequence Homology to an 

Arabldopsis Protox-1 Coding Sequence 
A Lambda-Zap cDNA library prepared from Beta vulgaris was obtained from Dr. 
Philip Rea. Dept. of Botany, Plant Science Institute, Philadelphia, PA (Yongcheol Kim, 
Eugene J. Kim, and Philip A. Rea, Plant Physiol. 106: 375-382 (1994)). Approximately 
50,000 pfu of the cDNA library were plated at a density of approximately 5,000 pfu per 10 
cm Petri dish and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher 
and Schuell). The plaque lifts were probed with the Arabldopsis protox-1 cDNA (SEQ ID 
N0:1) labeled with 32P-dCTP by the random priming method (Life Technologies). 
Hybridization conditions were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 
mM EDTA at 50'' C. Wash conditions were 2X SSC, 1 % SDS at 50** C. (Church and Gilbert 
(1984)). Positively hybridizing plaques were purified and in vivo excised Into pBluescript 
plasmids. The sequences of the cDNA inserts were detemnined by the chain tenmination 
method using dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest sugar beet protox-1 cDNA obtained, designated "sugar beet protox-1 is full- 
length based on comparison with the other known plant protox peptide sequences (Table 
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1 A). Sugar beet protox-1 is 1 91 0-bp in length and encodes a protein of 60 kDa. The 
nucleotide sequence of this cDNA and the amino acid sequence it encodes are set forth in 
SEQ ID N0s:15 and 16, respectively. The sugar beet protein is 73% identical (82% similar) 
to the Arabidopsis protox-1 protein. 

Sugar beet protox-1 , in the pBluescript SK vector, was deposited July 29, 1996, as 
pWDC-1 6 (NRRL #B-21 595N). 

Example 5: Isolation of an Oilseed Rape Protox-1 cDNA Based on Sequence Homology to 

an Arabidopsis Protox-1 Coding Sequence 
A Lambda Uni-Zap II cDNA library prepared from Brassica napus (3-4 wk. mature 
green leaves) was obtained from Dr. Guenther Ochs, Institut Fuer Allgemeine Botanik, 
Johannes Qutenberg-Universitaet Mainz, Gemiany (Giinther Ochs. Gerald Schock, and 
Aloysius Wild, Plant Physiol 103: 303-304 (1993)). Approximately 50,000 pfu of the cDNA 
library were plated at a density of approximately 5,000 pfu per 10 cm Petri dish and 
duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). The 
plaque lifts were probed with the Arabidopsis protox-1 cDNA (SEQ ID N0:1) labeled with 
32P-dCTP by the random priming method (Life Technologies). Hybridization conditions 
were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50*" C. Wash 
conditions were 2X SSC, 1% SDS at 50** C. (Church and Gilbert (1984)). Positively 
hybridizing plaques were purified and in vivo excised into pBluescript plasmids. The 
sequences of the cDNA Inserts were detemiined by the chain tenmination method using 
dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
oilseed rape protox-1 cDNA obtained, designated "rape protox-1 is full-length based on 
comparison with the other known plant protox peptide sequences (Table 1 A). Rape protox- 
1 is 1784-bp in length and encodes a protein of 57.3kD. The nucleotide sequence of this 
cDNA and the amino acid sequence it encodes are set forth In SEQ ID NOs: 17 and 1 8, 
respectively. The oilseed rape protein is 87% identical (92% similar) to the Arabidopsis 
protox-1 protein. 

Rape protox-1 , In the pBluescript SK vector, was deposited August 23, 1996, as 
pWDC-17 (NRRL #B-21615). 

Example 6: Isolation of a Rice Protox-1 cDNA Based on Sequence Homology to a Maize 

Protox-1 Coding Sequence 
A Lambda gtl 1 cDNA library prepared from Oryza satlva (5 day etiolated shoots) 
was purchased from Clontech. Approximately 50,000 pfu of the cDNA library were plated at 
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a density of approximately 5,000 pfu per 10 cm Petri dish and duplicate filter lifts were made 
onto nitrocellulose membranes (Schleicher and Schuell). The plaque lifts were probed with 
the maize protox-1 cDNA (SEQ ID NO:5) labeled with 32P-dCTP by the random priming 
method (Life Technologies). Hybridization conditions were 7% sodium dodecy! sulfate 
(SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash conditions were 2X SSC, 1% 
SDSatSO C. (Church and Gilbert (1984)). Positively hybridizing plaques were purified, 
and lambda DNA was prepared using the Wizard Lambda-Prep kit (Promega). The cDNA 
inserts were subcloned as EcoR/ fragments into the pBluescript SK vector using standard 
techniques. The sequences of the cDNA inserts were detennined by the chain termination 
method using dideoxy temiinators labeled with fluorescent dyes (Applied Biosystems, Inc.). 
The longest rice protox-1 cDNA obtained, designated "rice protox-r, was 1224-bp in length. 
Rice protox-1 lacks coding sequence for the transit peptide plus approximately 172 amino 
acids of the mature coding sequence based on comparison with the other known plant 
protox peptide sequences (Table 1 A). The nucleotide sequence of this partial cDNA and 
the amino acid sequence it encodes are set forth in SEQ ID NOs:19 and 20, respectively. 

Rice protox-1 , in the pBluescript SK vector, was deposited December 6, 1 996, as 
pWDC-18 (NRRL #B-21648). 

Example 7: Isolation of a Sorghum Protox-1 cDNA Based on Sequence Homology to a 

Maize Protox-1 Coding Sequence 
A Lambda-Zap II cDNA library prepared from Sorghum bicolor (3-6 day green 
seedlings) was obtained from Dr. Klaus Pf izenmaier, Institute of Cell Biology and 
Immunology, University of Stuttgart. Genmany (Harald Wajant, Kari-Wolfgang Mundry, and 
Klaus Pfizenmaier, Plant MoL Biol. 26: 735-746 (1994)). Approximately 50,000 pfu of the 
cDNA library were plated at a density of approximately 5,000 pfu per 10 cm Petri dish and 
duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). The 
plaque lifts were probed with the maize protox-1 cDNA (SEQ ID NO:5) labeled with 32P- 
dCTP by the random priming method (Life Technologies). Hybridization conditions were 7% 
sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50° C. Wash 
conditions were 2X SSC, 1% SDS at 50° C. (Church and Gilbert (1984)). Positively 
hybridizing plaques were purified and in vivo excised into pBluescript plasmids. The 
sequences of the cDNA inserts were detennined by the chain temnination method using 
dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
sorghum protox-1 cDNA obtained, designated "sorghum protox-r, was 1590-bp in length. 
Sorghum protox-1 lacks coding sequence for the transit peptide plus approximately 44 



71 



wo 01/68826 



PCT/EPOl/02753 



amino acids of the mature coding sequence based on comparison with the other l<nown 
plant protox peptide sequences (Table 1A). The nucleotide sequence of this partial cDNA 
and the amino acid sequence it encodes are set forth in SEQ ID N0s:21 and 22, 
respectively. 

Sorghum protox-1 Jn the pBluescript SK vector, was deposited December 6, 1996, 
as pWDC-19 (NRRL #B-21649). 

Example 8: Isolation of a Sugar Cane Protox-1 cDNA Based on Sequence Homology to a 

Maize Protox*1 Coding Sequence 
A Lambda-Zap II cDNA library prepared from sugar cane was obtained from Henrik 
Albert of USDA/ARS at the Hawaii Agricultural Research Center. Approximately 50,000 pf u 
of the cDNA library were plated at a density of approximately 5,000 pfu per 10 cm Petri dish 
and duplicate filter lifts were made onto nitrocellulose membranes (Schleicher and Schuell). 
The plaque lifts were probed with the maize protox-1 cDNA (SEQ ID NO:5) labeled with 
32P-dCTP by the random priming method (Life Technologies). Hybridization conditions 
were 7% sodium dodecyl sulfate (SDS), 0.5 M NaP04 pH 7.0, 1 mM EDTA at 50** C. Wash 
conditions were 2X BSC, 1% SDS at 50*" C. (Church and Gilbert (1984)). Positively 
hybridizing plaques were purified and in vivo excised into pBluescript plasmids. The 
sequences of the cD^4A inserts were determined by the chain temnination method using 
dideoxy terminators labeled with fluorescent dyes (Applied Biosystems, Inc.). The longest 
sugar cane protox-1 cDNA obtained, designated "sugar cane protox-1", was 633-bp In 
length. Sugar cane protox-1 lacks coding sequence for the transit peptide plus 
approximately 382 amino acids of the mature coding sequence based on comparison with 
the other known plant protox peptide sequences (Table 1 A). The nucleotide sequence of 
this partial cDNA and the amino acid sequence it encodes are set forth in SEQ ID NOs:36 
and 37, respectively. 

Example 9: Demonstration of Plant Protox Clone Sensitivity to Protox Inhibitory Herbicides 

in a Bacterial System 
Liquid cultures of protox-1 /SASX38, protox-2/SASX38 and pBluescript/XL1-Blue 
were grown in L amp^**. One hundred microliter aliquots of each culture were plated on L 
amp^^ media containing various concentrations (1 .OnM-IOmM) of a protox inhibitory 
aryluracil herbicide of fom^ula XVII. Duplicate sets of plates were incubated for 1 8 hours at 
37** C. 
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The protox+ E co// strain XL1-Blue showed no sensitivity to the herbicide at any 
concentration, consistent with reported resistance of the native bacterial enzyme to similar 
herbicides. The protox-1/SASX38 was clearly sensitive, with the lawn of bacteria almost 
entirely eliminated by inhibitor concentrations as low as lOnM. The protox-2/SASX38 was 
also sensitive, but only at a higher concentration (lOpiVI) of the herbicide. The herbicide was 
effective even on plates maintained almost entirely in the dark. The toxicity of the herbicide 
was entirely eliminated by the addition of 20 pg/ml hematin to the plates. 

The different herbicide tolerance between the two plant protox strains is likely the 
result of differential expression from these two piasmids, rather than any inherent difference 
in enzyme sensitivity. Protox-1/SASX38 grows much more slowly than protox-2/SASX38 in 
any heme-deficient media. In addition, the Mzprotox-2/SASX38 strain, with a growth rate 
comparable to Arab protox-1/SASX38. is also very sensitive to herbicide at the lower (10- 
lOOnM) concentrations. 

Section B: Identification and Characterization of Plant Protox Genes 
Resistant to Protox-lnhibitory Herbicides 

Example 10: Selecting for Plant Protox Genes Resistant to Protox-lnhibitory Herbicides in 

the E CO// Expression System 
An Arabidopsis thaliana (Landsberg) cDNA library in the plasmid vector pFLGI 
(Minet ef a/.. Plant J. 2:417-422 (1992) was obtained and amplified. The E colihemG 
mutant SASX38 (Sasanman ef a/.. J. Gen. Microbiol. 113:297(1979)) was obtained and 
maintained on L media containing 20ug/ml hematin (United States Biochemicals). The 
plasmid library was transfonned into SASX38 by electroporation using the Bio-Rad Gene 
Pulser and the manufacturer's conditions. The electroporated cells were plated on L agar 
containing 100ug/ml ampicillin at a density of approximately 500,000 transf ormants/1 0cm 
plate. The cells were then incubated at 37°C for 40 hours in low light and selected for the 
ability to grow without the addition of exogenous heme. Heme prototrophs were recovered 
at a frequency of 400/10^ from the pFL61 library. Sequence analysis of twenty-two 
complementing clones showed that nine are of the type designated "protox-1 the protox 
gene expected to express a chloroplastic protox enzyme. 

The pFLBI library is a yeast expression library, with the Arabidopsis cDNAs inserted 
bidirectionally. These cDNAs can also be expressed In bacteria. The protox cDNAs 
apparently initiate at an in-frame ATG in the yeast PGK 3' sequence approximately 10 
amino acids 5* to the NotI cloning site in the vector and are expressed either from the lacZ 
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promoter SOObp further upstream or from an undefined cryptic bacterial promoter. Because 
protox-1 cDNAs tliat included significant portions of a chloroplast transit sequence inhibited 
the growth of the £ co// SASX38 strain, the clone with the least amount of chloroplast 
transit sequence attached was chosen for mutagenesis/herbicide selection experiments. 
This clone, pSLV19, contains only 17 amino acids of the putative chloroplast transit peptide, 
with the DNA sequence beginning at bp-151 of the Arabidopsis protox-1 cDNA (SEQ ID 
N0:1). 

The plasmid pSLV19 was transfonned into the random mutagenesis strain XLI-Red 
(Stratagene, La Jolia, CA). The transfonnation was plated on L media containing 50ug/ml 
ampicillin and incubated for 48 hours at 37^C. Lawns of transfonmed cells were scraped 
from the plates and plasmid DNA prepared using the Wizard Megaprep kit (Promega, 
Madison, Wl). Plasmid DNA isolated from this mutator strain is predicted to contain 
approximately one random base change per 2000 nucleotides (see Greener et al, 
Strategles7{2):32-34 (1994). 

The mutated plasmid DNA was transfonned into the hemG mutant SASX38 
(Sasarman et ai, J. Gen, Microbiol. 1 13:297 (1979) and plated on L media containing 
various concentrations of protox-inhibiting herbicide (formula XVII). The plates were 
incubated for 2 days at 37*^0. Plasmid DNA was isolated from all colonies that grew in the 
presence of herbicide concentrations that effectively killed the wild type strain. The isolated 
DNA was then transfonmed into SASX38 and plated again on herbicide to ensure that the 
resistance obsen/ed was plasmid-bome. The protox coding sequence from plasmids 
passing this screen was excised by Wof/ digestion, recloned into an unmutagenized vector, 
and tested again for the ability to confer herbicide tolerance. The DNA sequence of protox 
cDNAs that confenred herbicide resistance was then determined and mutations identified by 
comparison with the wild type Arabidopsis protox-l sequence (SEQ ID N0:1). 

A single coding sequence mutant was recovered from the first mutagenesis 
experiment. This mutant leads to enhanced herbicide "resistance" only by increasing 
growth rate. It contains a C to A mutation at nucleotide 197 In SEQ ID NO:1 in the 
truncated chloroplast transit sequence of pSLV19, converting an ACG codon for threonine 
to an AAG codon for lysine at amino acid 56 of SEQ ID NO:2, and resulting in better 
complementation of the bacterial mutant This plasmid also contains a silent coding 
sequence mutation at nucleotide 1059, with AGT (Ser) changing to AGO (Ser). This 
plasmid was designated pMut-1. 

The pMut-1 plasmid was then transfonned into the mutator XLI-Red strain as 
described above and the mutated DNA was isolated and plated on an herbicide 
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concentration that is lethal to the unmutagenized pMut-1 protox gene. Herbicide tolerant 
colonies were isolated after two days at 37**C and analyzed as described above. IVIultiple 
plasmids were shown to contain herbicide resistant protox coding sequences. Sequence 
analysis indicated that the resistant genes fell into two classes. One resistance mutation 
identified was a C to T change at nucleotide 689 in the Arabidopsis protox-1 sequence set 
forth in SEQ ID NO:1 . This change converts a GCT codon for alanine at amino acid 220 of 
SEQ ID N0:2 to a GTT codon for valine, and was designated pAraC-1 Val {see, Table 1 B; 
sub-sequence 3). 

A second class of herbicide resistant mutant contains an A to G change at 
nucleotide 1307 in the Arabidopsis protox-1 sequence. This change converts a TAG codon 
for tyrosine at amino acid 426 to a TGC codon for cysteine, and was designated pAraC- 
2Cys {see, Table 1 B; sub-sequence 7). 

A third resistant mutant has a G to A change at nucleotide 691 in the Arabidopsis 
protox-1 sequence. This mutation converts a GGT codon for glycine at amino acid 221 to 
an AGT codon for serine at the codon position adjacent to the mutation in pAraG-1 . This 
plasmid was designated pAraC-3Ser {see, Table 1 B; sub-sequence 4). 

Resistant mutant pAraC-2Cys. in the pMut-1 plasmid. was deposited on November 
14, 1994 under the designation pWDC-7 with the Agricultural Research Culture Collection 
and given the deposit designation NRRL #21339N. 

Example 1 1 : Additional Herbicide-Resistant Codon Substitutions at Positions Identified in 

the Random Screen 

The amino acids identified as herbicide resistance sites in the random screen are 
replaced by other amino acids and tested for function and for herbicide tolerance in the 
bacterial system. Oligonucleotide-directed mutagenesis of the Arabidopsis prot6x-1 
sequence is performed using the Transformer Site-Directed l\/lutagenesis Kit (Clontech, 
Palo Alto, CA). After amino acid changes are confimied by sequence analysis, the mutated 
plasmids are transfomned into SASX38 and plated on L-amp™ media to test for function 
and on various concentrations of protox-inhibiting hertjicide to test for tolerance. 

This procedure is applied to the alanine codon at nucleotides 688-690 and to the 
tyrosine codon at nucleotides 1306-1308 of the Arabidopsis protox-1 sequence (SEQ ID 
NO:1). The results demonstrate that the alanine codon at nucleotides 688-690 can be 
changed to a codon for valine (pAraC-1 Val). threonine (pAraC-IThr), leucine (pAraC-1 Leu), 
cysteine (pAraC-ICys), or isoleucine (pAraC-llle) to yield an herbicide-resistant protox 
enzyme that retains function {see. Table 1 B; sub-sequence 3). The results further 
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demonstrate that the tyrosine codon at nucleotides 1306-1308 can be changed to a codon 
for cysteine (pAraC-2Cys), isoleucine (pAraC-2lle). leucine (pAraC-2Leu), threonine (pAraC- 
2Thr), methionine (pAraC-2Met). valine (pAraC-2Val), or alanine (pAraC-2Ala) to yield an 
herbicide-resistant protox enzyme that retains function {see, Table IB; sub-sequence 7). 

Example 12: Isolation of Additional Mutations that Increase Enzyme Function and/or 
Herbicide Tolerance of Previously Identified Resistant Mutants 

Plasmids containing herbicide resistant protox genes are transfonmed into the 
mutator strain XLI-Red and mutated DNA is isolated as described above. The mutated 
plasmids are transfomied into SASX38 and the transfomnants are screened on herbicide 
concentrations (fonmula XVII) sufficient to Inhibit growth of the original "resistant" mutant. 
Tolerant colonies are isolated and the higher tolerance phenotype is verified as being 
coding sequence dependent as described above. The sequence of these mutants is 
detemnined and mutations identified by comparison to the progenitor sequence. 

This procedure was applied to the pAraC-1 Val mutant described above. The results 
demonstrate that the serine codon at amino acid 305 (SEQ ID NO:2) can be changed to a 
codon for leucine to yield an enzyme with higher tolerance to protox-inhibiting herbicides 
than the pAraC-IVal mutant alone. This second site mutation is designated AraC305Leu 
{see, Table 1 B; sub-sequence 13). The same results are demonstrated for the threonine 
codon at amino acid 249, where a change to either isoleucine or to alanine leads to a more 
tolerant enzyme {see. Table IB; sub-sequence 12). These changes are designated 
AraC249lle and AraC249Ala, respectively. 

The procedure was also applied to the pAraC-2Cys mutant described above. The 
results demonstrate that the proline codon at amino acid 1 1 8 (SEQ ID N0:2) can be 
changed to a codon for leucine to yield an enzyme with higher tolerance to protox-inhibiting 
herbicides than the pAraC-ICys mutant alone. This mutation is designated AraCI 18Leu 
(see. Table 1 B; sub-sequence 11). The same results are demonstrated for the serine 
codon at amino acid 305, where a change to leucine leads to a more tolerant pAraC-2Cys 
enzyme {see, Table IB; sub-sequence 13). This change was also Isolated with the pAraC- 
1 Val mutant as described above and is designated AraC305Leu. Additional mutations that 
enhance the heriDicide resistance of the pAraC-2Cys mutant include an asparagine to 
serine change at amino acid 425, designated AraC425Ser (Table 1 B; sub-sequence 14), 
and a tyrosine to cysteine at amino acid 498, designated AraC4g8Cys (Table IB; sub- 
sequence 15). 
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These changes (Table IB; sub-sequences 11-15) are referred to as "second site" 
mutations, because they are not sufficient to confer herbicide tolerance alone, but rather 
enhance the function and/or the herbicide tolerance of an already mutant enzyme. This 
does not preclude the possibility that other amino acid substitutions at these sites could 
suffice to produce an herbicide tolerant enzyme since exhaustive testing of all possible 
replacements has not been performed. 

Example 13: Combining Identified Resistance Mutations with Identified Second Site 
Mutations to Create Highly Functional/Highly Tolerant Protox Enzymes 
The AraC305Leu mutation described above was found to enhance the 
function/herbicide resistance of both the AraC-1 Val and the AraC-2Cys mutant plasmids. In 
an effort to test the general usefulness of this second site mutation, it was combined with 
the AraC-2Leu, AraC-2Val, and AraC-2lle mutations and tested for herbicide tolerance. In 
each case, the AraCSOSLeu change significantly increased the growth rate of the resistant 
protox mutant on protox-lnhibiting herbicide. Combinations of the AraC-2lle resistant 
mutant with either the second site mutant AraC249lle or AraCI ISLeu also produced more 
highly tolerant mutant protox enzymes. The AraC249lle mutation demonstrates that a 
second site mutation identified as enhancing an AraC-1 (sub-sequence 3) mutant may also 
increase the resistance of an AraC-2 (sub-sequence 7) mutant. A three mutation plasmid 
containing AraC-2lle, AraC305Leu, and AraC249lle (Table IB; sub-sequences 7, 13, and 
12) has also been shown to produce a highly functional, highly herbicide tolerant protox-1 
enzyme. 

Example 14: Identification of Sites in the Maize Protox-1 Gene that Can Be Mutated to Give 

Herbicide Tolerance 

The pMut-1 Arabidopsis protox -1 plasmid described above is very effective when 
used in mutagenesis/screening experiments in that it gives a high frequency of genuine 
coding sequence mutants, as opposed to the frequent up-promoter mutants that are 
isolated when other plasmids are used. In an effort to create an efficient plasmid screening 
system for maize protox-1, the maize cDNA was engineered into the pMut-1 vector in 
approximately the same sequence context as the Arabidopsis cDNA. Using standard 
methods of overlapping PCR fusion, the 5' end of the pMut-1 Arabidopsis clone (including 
17 amino acids of chloroplast transit peptide with one mis-sense mutation as described 
above) was fused to the maize protox-1 cDNA sequence starting at amino acid number 14 
of the maize sequence (SEQ ID N0:6). The 3' end of the maize cDNA was unchanged. 
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NotI restriction sites were placed on both ends of this fusion, and the chimeric gene was 
cloned into the pFL61 plasmid backbone from pMut-1 . Sequence analysis revealed a single 
nucleotide PCR-derived silent mutation that converts the ACQ codon at nucleotides 745- 
747 (SEQ ID NO:5) to an ACT codon, both of which encode threonine. This chimeric Arab- 
maize protox-1 plasmid was designated pMut-3. 

The pMut-3 plasmid was transfomied into the mutator XL1-Red strain as described 
above and the mutated DNA was isolated and plated on a herbicide concentration (formula 
XVII) that was lethal to the unmutagenized pMut-3 maize protox gene. Herbicide tolerant 
colonies were Isolated after two days at 37^C and analyzed as described above. This 
analysis revealed multiple plasmids containing herbicide resistant protox coding sequences. 
Sequence analysis showed 5 single base changes that individually result in an herbicide 
tolerant maize protox-1 enzyme. Three of these mutations con-espond to amino acid 
changes previously shown to confer tolerance at the homologous position in the 
>A/aWdops/s protox-1 gene. Two of the three are pMzC-IVal and pMzC-IThr, converting the 
alanine (GCT) at amino acid 164 (SEQ ID N0:6) to either valine (GAT) or to threonine 
(ACT). This position corresponds to the pAraC-1 mutations described above (see. Table 
1 B; sub-sequence 3). The third analogous change, pMzC-3Ser, converts the glycine (GGT) 
at amino acid 165 to Serine (AGT), con-esponding to the AraC-3Ser mutation described 
above (see, Table 1 B; sub-sequence 4). These results sen/e to validate the expectation 
that herbicide-tolerant mutations Identified in one plant protox gene may also confer 
hert)icide tolerance in an equivalent plant protox gene from another species, 

Two of the mutations isolated from the maize protox-1 screen result in amino acid 
changes at residues not previously identified as herbicide resistance sites. One change 
(iViz159Phe) converts cysteine (TGC) to phenylalanine (TTC) at amino acid 159 of the 
maize protox-1 sequence (SEQ ID N0:6) (see, Table IB; sub-sequence 2). The second 
(Mz419Thr) converts isoleucine (ATA) to threonine (ACA) at amino acid 419 (see. Table IB; 
sub-sequence 9). 

Additional amino acid substitutions were made and tested at three of the maize 
mutant sites. Tolerance was demonstrated when glycine 165 was changed to leucine 
(pMzC-3Leu) or when cysteine 159 was changed to either leucine (Mz159Leu) or to lysine 
(Mz159Lys) (see. Table IB; sub-sequences 4 and 2). Tolerant enzymes were also created 
by changing isoleucine 419 to histidine (Mz419His), glycine (Mz419Gly), or asparagine 
(Mz419Asn) (see, Table IB; sub-sequence 9). 

Individual amino acid changes that produced highly herbicide tolerant Arabidopsis 
protox-1 enzymes were engineered into the maize protox-1 gene by site-directed 
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mutagenesis as described above. Bacterial testing demonstrated that changing the alanine 
(GOT) at amino acid 164 (SEQ ID N0:6) to leucine (CTT) produced a highly tolerant maize 
enzyme (pMzC-1Leu) {see, Table 1 B; sub-sequence 3). No mutation analogous to the 
ArBC-2 site (Table 1B; sub-sequence 7) in ArabldopsisyNas isolated in the maize random 
screen. However, changing this site, tyrosine 370 in the maize enzyme (SEQ ID N0:6), to 
either isoleucine (pMzC-2lle) or methionine (pMzC-2Met) did produce heri:)icide tolerant 
enzymes (see. Table 1B; sub-sequence 7). 

Additional mutant screens performed as described eariier in this example, except 
using formulas XXIIIa and XXIIIb instead of XVII, Identified three additional amino acid 
changes that confer tolerant protox enzymes. One, using formula XXIIIb, demonstrated that 
changing the arginine (CGT) at amino acid 88 (SEQ ID NO:6) to cysteine (TGT) produced a 
highly tolerant maize enzyme (IVIz88Cys) {see, Table IB; sub-sequence 1). Another, using 
formula IVc, demonstrated that changing the arginine (CGT) at amino acid 88 (SEQ ID 
N0:6) to leucine (CTT) produced a highly tolerant maize enzyme (Mz88Cys) (see, Table 1B; 
sub-sequence 1). Another, using fomnula XXIIIa, demonstrated that changing both the 
leucine (TTA) at amino acid 347 (SEQ ID NO:6) to serine (TCA) and the alanine (GCA) at 
amino acid 453 (SEQ ID NO:6) to threonine (ACA) produced a highly tolerant maize 
enzyme (Mz347Ser453Thr) (see, Table IB; sub-sequences 18 and 17). Unlike the second 
site mutations described above, which increase enzyme function and/or heriDicide tolerance 
of previously identified resistant mutants, Mz347Ser453Thr is a "double mutant" that 
requires that both mutations be present for heriDicide tolerance. 

Additional mutant screens performed as described eariier in this example identified 
two additional amino acid changes that confer tolerant protox enzymes. One, using fonmula 
XXIV, demonstrated that changing the alanine (GCA) at amino acid 175 (SEQ ID NO:6) to 
valine (GTA) or to threonine (ACA) produced a highly tolerant maize enzyme (Mz175Val 
and Mz175Thr, respectively) (see, Table IB; sub-sequence 18). Another, using fomiula IVc, 
demonstrated that changing the leucine (TTG) at amino acid 337 (SEQ ID NO:6) to serine 
(TCG) produced a highly tolerant maize enzyme (Mz337Ser) (see, Table IB; sub-sequence 
19). 

Example 15: Identification of Sites in the Wheat Protox-1 Gene that can be Mutated to Give 

HeriDicide Tolerance 

To create an efficient plasmid screening system for wheat protox-1 , the wheat cDNA 
was engineered into the pMut-1 vector as described above for the maize cDNA. This 
chimeric Arab-wheat protox-1 plasmid is designated pMut-4. The pMut-4 DNA was 
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mutated and screened for herbicide tolerance as described above. This analysis revealed 
multiple plasmids containing herbicide resistant protox coding sequences. Sequence 
analysis showed 7 single base changes that individually result in an herbicide tolerant 
wheat protox-1 enzyme. Four of these mutations correspond to amino add changes 
previously shown to confer tolerance at the homologous position in the Arabidopsis and/or 
in the maize protox-1 gene. Two, pWhtC-1 Val and pWhtC-1Thr. convert the alanine (GCT) 
at amino acid 21 1 (SEQ ID NO:10) to valine (GAT) and to threonine (ACT), respectively. 
This position corresponds to the pAraC-1 mutations described above {see, Table IB; sub- 
sequence 3). The third analogous change, pWhtC-3Ser, converts the glycine (GGT) at 
amino acid 212 to serine (AGT), conresponding to the AraC-SSer mutation described above 
{see, Table 1 B; sub-sequence 4). The fourth, Wht466Thr, converts isoleucine (ATA) to 
threonine (ACA) at amino acid 466, corresponding to the Mz419Thr mutant from maize 
(see, Table 1B; sub-sequence 9). 

Three of the mutations Isolated from the wheat protox-1 screen result in amino acid 
changes at residues not previously identified as herbicide resistance sites. One change 
(Wht356Leu) converts valine (GTT) to leucine (CTT) at amino acid 356 of the wheat protox- 
1 sequence (SEQ ID N0:10) {see, Table IB; sub-sequence 6). A second (Wht421Pro) 
converts serine (TCT) to proline (CCT) at amino acid 421 {see, Table IB; sub-sequence 8). 
The third (Wht502Ala) converts valine (GTT) to alanine (GCT) at amino acid 502 {see. 
Table IB; sub-sequence 10). 

Example 16: Identification of Sites in the Soybean Protox-1 Gene that can be Mutated to 

Give Herisicide Tolerance 
To create an efficient plasmid screening system for soybean protox-1, the soybean 
cDNA was engineered Into the pMut-1 vector as described above for the maize cDNA. This 
chimeric Arab-soybean protox-1 plasmid is designated pMut-5. The pMut-5 DNA was 
mutated and screened for heri^iclde tolerance as described above. This analysis revealed 
multiple plasmids containing herbicide resistant protox coding sequences. Sequence 
analysis showed 4 single base changes that individually result in an herbicide tolerant 
soybean protox-1 enzyme. Two of these mutations correspond to amino acid changes 
previously shown to confer tolerance at the homologous position in the Arabidopsis and/or 
in the wheat protox-1 gene. One, pSoyC-IThr, converts the alanine (GCA) at amino acid 
226 (SEQ ID NO:12) to threonine (ACA). This position con^esponds to the pAraC-1Thr 
mutation described above {see. Table IB; sub-sequence 3). The second analogous 
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change, Soy517Ala, converts the valine (GTT) at amino acid 517 to alanine (GCT), 
coresponding to the Wht502Ala mutation from wheat (see. Table 1B; sub-sequence 10). 

Two of the mutations isolated from the soybean protox-1 screen result in amino acid 
changes at a residue not previously identified as an herbicide resistance site. One change 
(Soy369Ser) converts proline (CCT) to serine (TCT) at amino acid 369 of the soybean 
protox-1 sequence (SEQ ID N0:12) {see, Table IB; sub-sequence 5). A second 
(Soy369His) converts this same proline369 to histidine (CAT) (see, Table IB; sub-sequence 
5). 

Individual amino acid changes that produced highly herbicide tolerant >AraWc/ops/s 
protox-1 enzymes were engineered into the soybean protox-1 gene by site directed 
mutagenesis as described above. Bacterial testing demonstrated that changing the alanine 
(GCA) at amino acid 226 (SEQ ID N0:12) to leucine (pSoyC-1Leu) produced a tolerant 
soybean enzyme {see, Table 1 B; sub-sequence 3). Changing the tyrosine (TAC) at amino 
acid 432 (SEQ ID NO:12) to either leucine (pSoyC-2Leu) or isoleucine (pSoyC-2lle) also 
produced herisicide tolerant enzymes {see. Table IB; sub-sequence 7). 

Example 17: Identification of Sites in the Sugar Beet Protox-1 Gene that can be Mutated to 

Give Heri3icide Tolerance 

To create an efficient plasmid screening system for sugar beet protox-1, the sugar 
beet cDNA was engineered into the pMut-1 vector as described above for the maize cDNA. 
This chimeric Arab-sugar beet protox-1 plasmid is designated pMut-6. The pMut-6 DNA 
was mutated and screened for herbicide tolerance as described above. This analysis 
revealed multiple plasmids containing heriDiclde resistant protox coding sequences. 
Sequence analysis showed a single base change that results in an herbicide tolerant sugar 
beet protox-1 enzyme. This change (pSugC-2Cys) converts tyrosine (TAC) at amino acid 
449 to cysteine (TGC) and Is analogous to the AraC-2 mutations in Arabidopsis {see, Table 
IB; sub-sequence 7). 

Individual amino acid changes that produced highly hert^icide tolerant Arabidopsis 
protox-1 enzymes were engineered into the sugar beet protox-1 gene by site directed 
mutagenesis as described above. Bacterial testing demonstrated that changing the 
tyrosine (TAC) at amino acid 449 to leucine (pSugC-2Leu), isoleucine (pSugC-2lle), valine 
(pSugC-2Val), or methionine (pSugC-2Met) produced hert^icide tolerant sugar beet 
enzymes (see, Table IB; sub-sequence 7). 
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Example 18: Identification of Sites in the Cotton Protox-1 Gene that can be Mutated to Give 

Heiticide Tolerance 

In an effort to create an efficient plasmid screening system for cotton protox-1 , the 
cotton cDNA was engineered Into the pMut-1 vector as described above for the maize 
cDNA. This chimeric Arab-cotton protox-1 plasmid is designated pMut-7. The pMut-7 DNA 
was mutated and screened for herbicide tolerance as described above. This analysis 
revealed multiple plasmids containing herbicide resistant protox coding sequences. 
Sequence analysis showed 3 single base changes that individually result In an heri^icide 
tolerant cotton protox-1 enzyme. Three mutants, pCotC-2Cys, pCotC-2His and pCotC- 
2Arg, change tyrosine (TAG) at amino acid 428 (SEQ ID N0:16) to cysteine (TGC), histidine 
(GAG) and to arginine (GGG). respectively {see. Table 1 B; sub-sequence 7), Arginine is a 
novel substitution giving tolerance at this previously identified AraG-2 (sub-sequence 7) site. 
The third mutation (Got365Ser) converts proline (GOT) to serine (TGT) at amino acid 365. 
This change corresponds to the soybean mutant Soy369Ser (see, Table IB; sub-sequence 
5). 

Example 19: Demonstration of Resistant Mutations' Gross-Tolerance to Various Protox- 

Inhibiting Compounds 

Resistant mutant plasmids, originally identified based on resistance against a single 
protox inhibitory heri^icide, were tested against a spectmm of other protox inhibiting 
compounds. For this test, the SASX38 strain containing the wild-type plasmid is plated on a 
range of concentrations of each compound to determine the lethal concentration for each 
one. Resistant mutant plasmids in SASX38 are plated and scored for the ability to sun/ive 
on a concentration of each compound at least 10 fold higher than the concentration that is 
lethal to the SASX38 strain containing the wild-type plasmid. 

Results from bacterial cross-tolerance testing, illustrated in Tables 3A and 3B, show 
that each of the mutations identified confers tolerance to a variety of protox inhibiting 
compounds. 

Section G: Expression of HeriDicide-Resistant Protox Genes in Transgenic Plants 

Example 20: Engineering of Plants Tolerant to Protox-lnhibiting HertDicides by Homologous 

Recombination or Gene Conversion 
Because the described mutant coding sequences effectively confer hertslcide 
tolerance when expressed under the control of the native protox promoter, targeted 
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changes to the protox coding sequence in its native chromosomal location represent an 
alternative means for generating herbicide tolerant plants and plant cells. A fragment of 
protox DNA containing the desired mutations, but lacking its own expression signals (either 
promoter or 3' untranslated region) can be introduced by any of several art-recognized 
methods (for instance, Agrobacterium transformation, direct gene transfer to protoplasts, 
microprojectile bombardment), and herbicide-tolerant transformants selected. The 
introduced DNA fragment also contains a diagnostic restriction enayme site or other 
sequence polymorphism that is introduced by site-directed mutagenesis in vitro without 
changing the encoded amino acid sequence (i.e. a silent mutation). As has been previously 
reported for various selectable marker and herbicide tolerance genes (see, e.g., 
Paszkowski et aL, EMBO J. 7: 4021 -4026 (1 988); Lee et aL, Plant Cell 2: 41 5-425 (1 990); 
Risseeuw et aL, Plant J. 7: 109-1 19 (1995)). some transformants are found to result from 
homologous integration of the mutant DNA into the protox chromosomal locus, or from 
conversion of the native protox chromosomal sequence to the introduced mutant sequence. 
These transformants are recognized by the combination of their herbicide-tolerant 
phenotype, and the presence of the diagnostic restriction enzyme site in their protox 
chromosomal locus. 



Example 21: Construction of Plant Transfomiation Vectors 
Numerous transfonmation vectors are available for plant transfonmation, and the 
genes of this invention can be used in conjunction with any such vectors. The selection of 
vector for use will depend upon the preferred transfomiation technique and th^ target 
species for transfomiation. For certain target species, different antibiotic or heriDicide 
selection markers may be prefen-ed. Selection markers used routinely in transfonnation 
include the npf//gene, which confers resistance to kanamycin and related antibiotics 
(l\/lessing & Vierra. Gene 79:259-268 (1982); Bevan etaL, Nature 304:184-187 (1983)). the 
bar gene, which confers resistance to the heriDicide phosphinothricin (White etaL, NucI 
Adds Res 18: 1 062 (1 990),. Spencer ef a/. Theor AppI Genet 79: 625-631 (1 990)), the hph 
gene, which confers resistance to the antibiotic hygromycin (Blochinger & Diggelmann, Mol 
Cell Biol 4: 2929-2931), and the d/ifrgene. which confers resistance to methotrexate 
(Bourouis etaL, EMBO J. 2^;: 1099-1 104 (1983)). 

I. Construction of Vectors Suitable for Agrobacterium Transformation 

Many vectors are available for transformation using Agrobacterium tumefaclens. 
These typically carry at least one T-DNA border sequence and include vectors such as 
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pBIN19 (Bevan, Nucl. Acids Res. (1984)) and pXYZ. Below the constaiction of two typical 
vectors is described. 

Construction of pCIB200 and pCIB2001: The binary vectors pCIB200 and pCIB2001 
are used for the construction of recombinant vectors for use with Agrobacterium and was 
constructed in the following manner. pTJS75kan was created by Narl digestion of pTJS75 
(Schmidhauser & Helinski, J BacterioL 164: 446-455 (1985)) allowing excision of the 
tetracycline-resistance gene, followed by insertion of an >4cc/ fragment from pUC4K carrying 
an NPTII (Messing & Vierra, Gene 19: 259-268 (1982); Bevan etaL, Nature 304: 184-187 
(1 983); McBride et al. Plant Molecular Biology 14: 266-276 (1 990)). Xhol linkers were 
ligated to the EcoflV fragment of pCIB7, which contains the left and right T-DNA borders, a 
plant selectable nos/npf// chimeric gene and the pUC polylinker (Rothstein et a/., Gene 53: 
153-161 (1987)), and the X/}o/-digested fragment was cloned into Sa//-dlgested pTJS75kan 
to create pCIB200 (see also EP 0 332 104, example 1 9). pCIB200 contains the following 
unique polylinker restriction sites: EcoRI, SstI, Kpnl, Bglll, Xbal, and Sail. pCIB2001 is a 
derivative of pCIB200, which is created by the insertion into the polylinker of additional 
restriction sites. Unique restriction sites in the polylinker of pCIB2001 are EcoRI, SstI, Kpnl, 
Bglll, Xbal, Sail, Mlul, Bell, Avrll, Apal, Hpal, and StuL pClB2001 , in addition to containing 
these unique restriction sites also has plant and bacterial kanamycin selection, left and right 
T-DNA borders for Agrobacterium-medlated transformation, the RK2-derived trfA function 
for mobilization between E. coll and other hosts, and the OnTand OnV functions also from 
RK2. The pCIB2001 polylinker is suitable for the cloning of plant expression cassettes 
containing their own regulatory signals. 

Constmction of pCIBIO and Hygromycin Selection Derivatives Thereof: The binary 
vector pCIBIO contains a gene encoding kanamycin resistance for selection in plants, T- 
DNA right and left border sequences and incorporates sequences from the wide host-range 
plasmid pRK252 allowing it to replicate in both E. co// and Agrobacterium. Its construction is 
described by Rothstein ef a/., Gene 53: 153-161 (1987). Various derivatives of pCIBIO 
have been constmcted that incorporate the gene for hygromycin B phosphotransferase 
described by Gritz ef a/., Gene 25: 179-188 (1983)). These derivatives enable selection of 
transgenic plant cells on hygromycin only (pCIB743), or hygromycin and kanamycin 
(PCIB715, PCIB717). 

II. Construction of Vectors Suitable for non->lgrofaacfent;m Transformation. 

Transformation without the use of Agrobacterium tumefaciens circumvents the 
requirement for T-DNA sequences in the chosen transfonnation vector and consequently 



84 



wo 01/68826 



PCT/EPOl/02753 



vectors lacking these sequences can be utilized in addition to vectors such as the ones 
described above that contain T-DNA sequences. Transfonmation techniques that do not 
rely on Agrobacterium include transfonmation via particle bombardment, protoplast uptake 
(e.g. PEG and electroporation) and microinjection. The choice of vector depends largely on 
the prefen^ed selection for the species being transfomned. Below, the construction of some 
typical vectors is described. 

Construction of pCIB3064: pCIB3064 is a pUC-derived vector suitable for direct 
gene transfer techniques in combination with selection by the herbicide basta (or 
phosphinothricin). The plasmid pCIB246 comprises the CaMV 35S promoter in operational 
fusion to the E. co// GUS gene and the CaMV 35S transcriptional tenninator and is 
described in the PCT published application WO 93/07278. The 35S promoter of this vector 
contains two ATG sequences 5' of the start site. These sites were mutated using standard 
PGR techniques in such a way as to remove the ATG's and generate the restriction sites 
Sspl and PvulL The new restriction sites were 96 and 37-bp away from the unique Sail site 
and 101 and 42-bp away from the actual start site. The resultant derivative of pCIB246 was 
designated pCIB3025. The GUS gene was then excised from pCIB3025 by digestion with 
Sail and Sac/, the termini rendered blunt and religated to generate plasmid pC!B3060. The 
plasmid pJIT82 was obtained from the John Innes Centre, Nonwich and the a 400-bp Smal 
fragment containing the tor gene from Streptomyces viridochromogenesweis excised and 
inserted into the Hpa/ site of pCIB3060 (Thompson etal. EMBO J 6: 2519-2523 (1987)). 
This generated pCIB3064, which comprises the bar gene under the control of the CaMV 
35S promoter and temiinator for herbicide selection, a gene for ampicillin resistance (for 
selection in £ coli) and a polylinker with the unique sites SphI, PstI, Hindlll, and BamHL 
This vector is suitable for the cloning of plant expression cassettes containing their own 
regulatory signals. 

Construction of pSOG19 and pSOG35: pSOG35 is a transfonmation vector that 
utilizes the E. co// gene dihydrofolate reductase (DHFR) as a selectable maricer conferring 
resistance to methotrexate. PCR was used to amplify the 35S promoter (-800-bp), intron 6 
from the maize Adhi gene (-550-bp) and 18-bp of the GUS untranslated leader sequence 
from pSOGIO. A 250-bp fragment encoding the £ co// dihydrofolate reductase type II gene 
was also amplified by PCR and these two PCR fragments were assembled with a Sacl-Pstl 
fragment from pBI221 (Clontech), which comprised the pUC19 vector backbone and the 
nopaline synthase terminator. Assembly of these fragments generated pS0G19, which 
contains the 35S promoter in fusion with the intron 6 sequence, the GUS leader, the DHFR 
gene and the nopaline synthase temninator. Replacement of the GUS leader in pS0G19 
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With the leader sequence from Maize Chlorotic Mottle Virus (MCMV) generated the vector 
pSOG35. pSOG19 and pSOG35 carry the pUC gene for ampicillin resistance and have 
Hindlll, SphI, Psf/and EcoRI sites availabie for the cloning of foreign sequences. 

Example 22: Construction of Plant Expression Cassettes 
Gene sequences intended for expression in transgenic plants are firstly assembled 
in expression cassettes behind a suitable promoter and upstream of a suitable transcription 
terminator. These expression cassettes can then be easily transferred to the plant 
transfomnation vectors described above in Example 21. 

I. Promoter Selection 

The selection of a promoter used in expression cassettes will determine the spatial 
and temporal expression pattern of the transgene in the transgenic plant. Selected 
promoters will express transgenes in specific cell types (such as leaf epidermal cells, 
mesophyll cells, root cortex cells) or in specific tissues or organs (roots, leaves or flowers, 
for example) and this selection will reflect the desired location of expression of the 
transgene. Alternatively, the selected promoter may drive expression of the gene under a 
light-induced or other temporally regulated promoter. A further alternative is that the 
selected promoter be chemically regulated. This would provide the possibility of inducing 
expression of the transgene only when desired and caused by treatment with a chemical 
inducer. 

II. Transcriptional Terminators 

A variety of transcriptional temiinators are available for use in expression cassettes. 
These are responsible for the termination of transcription beyond the transgene and its 
correct polyadenylation. Appropriate transcriptional temninators are those that are Icnown to 
function in plants and include the CaMV 35S terminator, the fm/temninator, the nopaline 
synthase terminator, the pea tbcS E9 tenninator, as well as terminators naturally associated 
with the plant protox gene (i.e. "protox tenminators"). These can be used in both 
monocotyledons and dicotyledons. 

III. Sequences for the Enhancement or Regulation of Expression 

Numerous sequences have been found to enhance gene expression from within the 
transcriptional unit and these sequences can be used in conjunction with the genes of this 
invention to increase their expression in transgenic plants. 
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Various intron sequences have been shown to enhance expression, particulariy in 
monocotyledonous cells. For example, the Introns of the maize Adhi gene have been 
found to significantly enhance the expression of the wild-type gene under its cognate 
promoter when introduced into maize cells. Intron 1 was found to be particularly effective 
and enhanced expression in fusion constructs with the chloramphenicol acetyltransferase 
gene (Callis ef a/., Genes Develop. 1:11 83-1 200 (1 987)). In the same experimental 
system, the intron from the maize bronzel gene had a similar effect in enhancing 
expression (Callis ef a/., supra). Intron sequences have been routinely incorporated into 
plant transfomiation vectors, typically within the non-translated leader. 

A number of non-translated leader sequences derived from viruses are also known 
to enhance expression, and these are particulariy effective in dicotyledonous ceils. 
Specifically, leader sequences from Tobacco Mosaic Vims (TMV, the **W-sequence"), Maize 
Chlorotic Mottle Virus (MCMV), and Alfalfa Mosaic Virus (AMV) have been shown to be 
effective in enhancing expression (e.g. Gallie ef a/. Nucl. Acids Res. 15: 8693-8711 (1987); 
Skuzeski ef a/. Plant Molec. Biol. 15: 65-79 (1990)) 

IV. Targeting of the Gene Product Within the Cell 

Various mechanisms for targeting gene products are known to exist in plants and 
the sequences controlling the functioning of these mechanisms have been characterized in 
some detail. For example, the targeting of gene products to the chloroplast is controlled by 
a signal sequence that is found at the amino terminal end of various proteins and that is 
cleaved during chloroplast import yielding the mature protein {e.g. Comai etal. J. Biol. 
Chem. 263: 15104-15109 (1988)). These signal sequences can be fused to heterologous 
gene products to effect the import of heterologous products into the chloroplast (van den 
Broeck etal. Nature 313: 358-363 (1985)). DNA encoding for appropriate signal sequences 
can be isolated from the 5' end of the cDNAs encoding the RUBISCO protein, the CAB 
protein, the EPSP synthase enzyme, the GS2 protein and many other proteins that are 
known to be chloroplast localized. 

Other gene products are localized to other organelles such as the mitochondrion 
and the peroxisome {e.g. Unger etal. Plant Molec. Biol. 13: 411-418 (1989)). The cDNAs 
encoding these products can also be manipulated to effect the targeting of heterologous 
gene products to these organelles. Examples of such sequences are the nuclear-encoded 
ATPases and specific aspartate amino transferase isoforms for mitochondria. Targeting to 
cellular protein bodies has been described by Rogers etal., Proc. Natl. Acad. ScL USA 82. 
6512-6516(1985)). 
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In addition, sequences have been characterized that cause the targeting of gene 
products to other cell compartments. Amino terminal sequences are responsible for 
targeting to the ER, the apoplast, and extracellular secretion from aleurone cells (Koehler & 
Ho, Plant Cell 2: 769-783 (1990)). Additionally, amino tenmlnal sequences in conjunction 
with carboxy tenminal sequences are responsible for vacuolar targeting of gene products 
(Shinshi etal., Plant Molec. Biol. 14: 357-368 (1990)). 

By the fusion of the appropriate targeting sequences described above to transgene 
sequences of interest it is possible to direct the transgene product to any organelle or cell 
compartment. For chloroplast targeting, for example, the chloroplast signal sequence from 
the RUBISCO gene, the CAB gene, the EPSP synthase gene, or the GS2 gene is fused in 
frame to the amino temninal ATG of the transgene. The signal sequence selected should 
include the known cleavage site and the fusion constructed should take into account any 
amino acids after the cleavage site that are required for cleavage. In some cases this 
requirement may be fulfilled by the addition of a small number of amino acids between the 
cleavage site and the transgene ATG or alternatively replacement of some amino acids 
within the transgene sequence. Fusions constructed for chloroplast import can be tested 
for efficacy of chloroplast uptake by in vitro translation of in vitro transcribed constructions 
followed by In Wfro chloroplast uptake using techniques described by (Bartlett etal. In: 
Edelmann etal. (Eds.) Methods in Chloroplast Molecular Biology, Elsevier, pp. 1081-1091 
(1982); Wasmann etai MoL Gen. Genet 205: 446-453 (1986)). These construction 
techniques are well known in the art and are equally applicable to mitochondria and 
peroxisomes. The choice of targeting that may be required for expression of the 
transgenes will depend on the cellular localization of the precursor required as the starting 
point for a given pathway. This will usually be cytosolic or chloroplastic, although it may is 
some cases be mitochondrial or peroxisomal. The products of transgene expression will not 
normally require targeting to the ER, the apoplast or the vacuole. 

The above described mechanisms for cellular targeting can be utilized not only in 
conjunction with their cognate promoters, but also in conjunction with heterologous 
promoters so as to effect a specific cell targeting goal under the transcriptional regulation of 
a promoter that has an expression pattern different to that of the promoter from which the 
targeting signal derives. 

Example 23: Transfonnation of Dicotyledons 
Transformation techniques for dicotyledons are well known in the art and include 
Agrobacteriunfhbased techniques and techniques that do not require Agrobacterium. Non- 
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Agrobacterium techniques involve tlie uptal<e of exogenous genetic material directly by 
protoplasts or cells. This can be accomplished by PEG or electroporation mediated uptake, 
particle bombardment-mediated delivery, or microinjection. Examples of these techniques 
are described by Paszkowski etaL, EMBO J 3; 2717-2722 (1984), Potrykus ef a/., MoL 
Gen. Genet 795; 169-177 (1985), Reich etaL, Biotechnology 4: 1001-1004 (1986), and 
Klein et al, Nature 327: 70-73 (1987). In each case the transformed cells are regenerated 
to whole plants using standard techniques known in the art. 

Agrobacterium-mediatBd transformation is a pref enred technique for transfonmatlon 
of dicotyledons because of its high efficiency of transfomnation and Its broad utility with 
many different species. The many crop species that are routinely transfonmable by 
Agrobacterium include tobacco, tomato, sunflower, cotton, oilseed rape, potato, soybean, 
alfalfa and poplar (EP 0 317 51 1 (cotton), EP 0 249 432 (tomato, to Calgene), WO 
87/07299 {Brassica, to Calgene), US 4,795.855 (poplar)). 

Transformation of the target plant species by recombinant Agrobacterium usually 
involves co-cultivation of the Agrobacterium with explants from the plant and follows 
protocols well known in the art. Transfomned tissue is regenerated on selectable medium 
canying the antibiotic or heribicide resistance maricer present between the binary plasmid T- 
DNA borders. 

Example 24: Transformation of Monocotyledons 
Transfomnation of most monocotyledon species has now also become routine. 
Preferred techniques include direct gene transfer into protoplasts using PEG or 
electroporation techniques, and particle bombardment into callus tissue. Transformations 
can be undertaken with a single DNA species or multiple DNA species {i.e. co- 
transformation) and both these techniques are suitable for use with this invention. Co- 
transformation may have the advantage of avoiding complex vector constmctlon and of 
generating transgenic plants with unlinked loci for the gene of interest and the selectable 
marker, enabling the removal of the selectable marker In subsequent generations, should 
this be regarded desirable. However, a disadvantage of the use of co-transfonnation is the 
less than 100% frequency with which separate DNA species are integrated into the genome 
(Schocher et al. Biotechnology 4: 1 093-1 096 (1 986)). 

Patent Applications EP 0 292 435 (to Ciba-Geigy), EP 0 392 225 (to Ciba-Geigy) 
and WO 93/07278 (to Ciba-Geigy) describe techniques for the preparation of callus and 
protoplasts from an 6lite inbred line of maize, transformation of protoplasts using PEG or 
electroporation, and the regeneration of maize plants from transformed protoplasts. 
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Gordon-Kamm ef a/., Plant Cell 2: 603-618 (1990)) and Fromm et aL, Biotechnology 8: 833- 
839 (1990)) have published techniques for transfonnation of A188-derrved maize line using 
particle bombardment. Furthermore, application WO 93/07278 (to Ciba-Geigy) and Koziel 
et aL, Biotechnology 1 1: 1 94-200 (1 993)) describe techniques for the transformation of dlite 
inbred lines of maize by particle bombardment. This technique utilizes immature maize 
embryos of 1 .5-2.5 mm length excised from a maize ear 14-15 days after pollination and a 
PDS-IOOOHe Biolistics device for bombardment. 

Transformation of rice can also be undertaken by direct gene transfer techniques 
utilizing protoplasts or particle bombardment. Protoplast-mediated transformation has been 
described for Japonica-tfpes and /ncf/ca-types (Zhang ef a/., Plant Cell Rep 7: 379-384 
(1988); Shimamoto etal. Nature 338: 274-277 (1989); Datta etal. Biotechnology 8:736-740 
(1990)). Both types are also routinely transfomnable using particle bombardment (Christou 
ef al. Biotechnology 9: 957-962 (1 991)). 

Patent Application EP 0 332 581 (to Ciba-Geigy) describes techniques for the 
generation, transfomnation and regeneration of Pooideae protoplasts. These techniques 
allow the transfomiation of Dactylis and wheat. Furthermore, wheat transfomnation was 
been described by Vasil etal.. Biotechnology 70:667-674 (1992)) using particle 
bombardment into cells of type C long-temi regenerable callus, and also by Vasil etal., 
Biotechnology 1 1: 1 553-1 558 (1 993)) and Weeks ef a/., Plant Physiol. 10Z 1 077-1 084 
(1993) using particle bombardment of immature embryos and immature embryo-derived 
callus. A preferred technique for wheat transformation, however, involves the 
transfonnation of wheat by particle bombardment of Immature embryos and includes either 
a high sucrose or a high maltose step prior to gene delivery. Prior to bombardment, any 
number of embryos (0.75-1 mm in length) are plated onto MS medium with 3% sucrose 
(Murashige & Skoog, Physiologia Plantamm 15: 473-497 (1962)) and 3 mg/l 2,4-D for 
induction of somatic embryos, which is allowed to proceed in the dari<. On the chosen day 
of bombardment, embryos are removed from the induction medium and placed onto the 
osmoticum {i.e. induction medium with sucrose or maltose added at the desired 
concentration, typically 15%). The embryos are allowed to plasmolyze for 2-3 h and are 
then bombarded. Twenty embryos per target plate is typical, although not critical. An 
appropriate gene-carrying plasmid (such as pCIB3064 or pSG35) is precipitated onto 
micrometer size gold particles using standard procedures. Each plate of embryos is shot 
with the DuPont Biolistics' helium device using a burst pressure of -1000 psi using a 
standard 80 mesh screen. After bombardment, the embryos are placed back into the dark 
to recover for about 24 h (still on osmoticum). After 24 hrs, the embryos are removed from . 
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the osmoticum and placed back onto induction medium wliere they stay for about a month 
before regeneration. Approximately one month later the embryo explants with developing 
embryogenic callus are transferred to regeneration medium (MS + 1 mg/liter NAA, 5 mg/liter 
GA), further containing the appropriate selection agent (10 mg/l basta in the case of 
pCIB3064 and 2 mg/I methotrexate in the case of pSOG35). After approximately one 
month, developed shoots are transfen-ed to larger sterile containers known as "GA7s" that 
contained half-strength MS, 2% sucrose, and the same concentration of selection agent. 
Patent application WO 94/13822 describes methods for wheat transfomnation and is hereby 
incorporated by reference. 

Example 25: Isolation of the Arabidopsis thaliana Protox-1 Promoter Sequence 
A Lambda Zap II genomic DNA library prepared from Arabidopsis thaliana 
(Columbia, whole plant) was purchased from Stratagene. Approximately 125,000 phage 
were plated at a density of 25,000 pfu per 15 cm Petri dish and duplicate lifts were made 
onto Colony/Plaque Screen membranes (NEN Dupont). The plaque lifts were probed with 
the Arabidopsis protox-1 cDNA (SEQ ID N0:1 labeled with 32P-dCTP by the random 
priming method (Life Technologies). Hybridization and wash conditions were at 65*'C as 
described in Church and Gilbert, Proc. Natl. Acad, ScL USA 81: 1991-1995 (1984). 
Positively hybridizing plaques were purified and in vivo excised into pBluescript plasmids. 
Sequence from the genomic DNA inserts was detemnined by the chain termination method 
using dideoxy temninators labeled with fluorescent dyes (Applied Biosystems, Inc.). One 
clone, AraPTI Pro, was determined to contain 580-bp of Arabidopsis sequence upstream 
from the initiating methionine (ATG) of the protox-1 protein coding sequence. This clone 
also contains coding sequence and introns that extend to-bp 1241 of the protox-1 cDNA 
sequence. The 580-bp 5' noncoding fragment is the putative Arabidopsis protox-1 
promoter, and the sequence is set forth in SEQ ID NO:13. 

AraPTI Pro was deposited December 15, 1995, as pWDC-11 (NRRL #8-21515) 

Example 26: Construction of Plant Transformation Vectors Expressing Altered Protox-1 
Genes Behind the Native Arabidopsis Protox-1 Promoter 
A full-length cDNA of the appropriate altered Arabidopsis protox-1 cDNA was 
isolated as an EcoRhXhol partial digest fragment and cloned into the plant expression 
vector pCGNI 761 ENX (see Example 9 of Intemational application no. PCT/1 895/00452 
filed June 8, 1995, published Dec. 21 , 1995 as WO 95/34659). This plasmid was digested 
with A/co/and BamHIXo produce a fragment comprised of the complete protox-1 cDNA plus 
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a transcription tenninator from the 3* untranslated sequence of the tml gene of 
Agrobacterium tumefaciens. The AraPTIPro plasmid described above was digested with 
Ncol and BamHI to produce a fragment comprised of pBluescript and the 580-bp putative 
Arabldopsis protox-1 promoter. Ligation of these two fragments produced a fusion of the 
altered protox cDNA to the native protox promoter. The expression cassette containing the 
protox-1 promoter/protox-1 cDNA/fm/tenminator fusion was excised by digestion with Kpnl 
and cloned into the binary vector pCIB200. The binary plasmid was transformed by 
electroporation into Agrobacterium and then into Arabidopsis using the vacuum Infiltration 
method (Bechtold etal., C.R. Acad. ScL Paris 3^6: 1194-1199 (1993). Transformants 
expressing altered protox genes were selected on kanamycin or on various concentrations 
of protox inhibiting herbicide. 

Example 27: Production of HertDicide Tolerant Plants by Expression of a Native Protox-1 

Promoter/Altered Protox-1 Fusion 
Using the procedure described above, an Arabidopsis protox-1 cDNA containing a 
TAG to ATG (Tyrosine to Methionine) change at nucleotides 1306-1308 in the protox-1 
sequence (SEQ ID NO:1) was fused to the native protox-1 promoter fragment and 
transfonmed into Arabidopsis thaliana. This altered protox-1 enzyme (AraC-2Met) has been 
shown to be >1 0-fold more tolerant to various protox-inhibiting herbicides than the naturally 
occumng enzyme when tested in the previously described bacterial expression system. 
Seed from the vacuum infiltrated plants was collected and plated on a range (lO.OnM- 
1 .OuM) of a protox Inhibitory aryluracil hert^icide of fomiuia XVII. Multiple experiments with 
wild type Arabidopsis have shown that a lO.OnM concentration of this compound Is 
sufficient to prevent nonnal seedling gemiinatlon. Transgenic seeds expressing the AraC- 
2Met altered enzyme fused to the native protox-1 promoter produced nomnal Arabidopsis 
seedlings at herbicide concentrations up to 500nM, indicating at least 50-fold higher 
herbicide tolerance when compared to wild-type Arabidopsis. This promoter/altered protox 
enzyme fusion therefore functions as an effective selectable mart<er for plant 
transfomnation. Several of the plants that germinated on lOO.OnM of protox-inhibiting 
herbicide were transplanted to soil, grown 2-3 weeks, and tested in a spray assay with 
various concentrations of the protox-inhibiting herbicide. When compared to empty vector 
control transformants, the AraPTI Pro/AraC-2Met transgenics were >1 0-fold more tolerant to 
the hert^icide spray. 
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EXAMPLE 28: Demonstration of resistant mutations' cross-tolerance to various protox- 
inhibiting compounds In an Arabidopsis germination assay. 
Using the procedure described above, an Arabidopsis protox-1 cDNA containing 
both a TAG to ATC (tyrosine to isoleucine) change at nucleotides 1306-1308 and a TCA to 
TTA (serine to leucine) change at nucleotides 945-947 in the protox-1 sequence (SEQ ID 
N0:1) was fused to the native protox-1 promoter fragment and transformed Into Arabidopsis 
tlialiana. This altered protox-1 enzyme (AraC-2lle + AraC305Leu) has been shown to be 
>1 0-fold more tolerant to a protox inhibitory aryluracil herbicide of formula XVII than the 
naturally occurring enzyme when tested in a bacterial system (see Examples 9-13). 
Homozygous Arabidopsis lines containing this fusion were generated from transformants 
that showed high tolerance to a protox inhibiting herbicide in a seedling germination assay 
as described above. The seed from one line was tested for cross-tolerance to various 
protox-inhibitory compounds by repeating the germination assay on concentrations of the 
compounds that had been shown to inhibit germination of wild-type Arabidopsis. The 
results from these experiments are shown in Table 4. 

Example 29: Isolation of a Maize Protox-1 Promoter Sequence 
A Zea Mays (Missouri 17 inbred, etiolated seedlings) genomic DNA library in the 
Lambda FIX II vector was purchased from Stratagene. Approximately 250,000 pf u of the 
library was plated at a density of 50,000 phage per 15 cm plate and duplicate lifts were 
made onto Colony/Plaque screen membranes (NEN Dupont). The plaque lifts were probed 
with the maize protox-1 cDNA (SEQ ID NO:5) labeled with 32P-dCTP by the random 
priming method (Life Technologies). Hybridization and wash conditions were at 65°C as 
described in Church and Gilbert, Proc. Natl. Acad. Sci. USA 81: 1991-1995 (1984). 
Lambda phage DNA was isolated from three positively hybridizing phage using the Wizard 
Lambda Preps DNA Purification System (Promega). Analysis by restriction digest, 
hybridization patterns, and DNA sequence analysis Identified a lambda clone containing 
approximately 3.5 kb of maize genomic DNA located 5' to the maize protox-1 coding 
sequence previously isolated as a cDNA clone. This fragment includes the maize protox-1 
promoter. The sequence of this fragment is set forth in SEQ ID NO:14. From nucleotide 1 
to 3532. this sequence is comprised of 5* noncoding sequence. From nucleotide 3533 to 
3848, this sequence encodes the 5' end of the maize protox-1 protein. 

A plasmid containing the sequence of SEQ ID N0:14 fused to the remainder of the 
maize protox-1 coding sequence was deposited March 19, 1996 as pWDC-14 (NRRL #B- 
21546). 
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Example 30: Construction of Plant Transformation Vectors Expressing Altered Protox-1 
Genes Behind the Native IVIaize Protox-1 Promoter 

The 3848-bp maize genomic fragment (SEQ ID NO: 14) was excised from the 
isolated lambda phage clone as a Sa//-Kpnl partial digest product and ligated to a Kpnl-Notl 
fragment derived from an altered maize protox-1 cDNA that contained an alanine to leucine 
change at amino acid 164 (SEQ ID N0:6). This created a fusion of the native maize protox- 
1 promoter to a full length cDNA that had been shown to confer herbicide tolerance in a 
bacterial system (Examples 9-14). This fusion was cloned into a pUCIS derived vector 
containing the CaMV 35S terminator sequence to create a protox promoter/altered protox 
cDNA/tenninator cassette. The plasmid containing this cassette was designated pWCo-1 . 

A second construct for maize transformation was created by engineering the first 
intron found in the coding sequence from the maize genomic clone back into the maize 
cDNA. The insertion was made using standard overlapping PGR fusion techniques. The 
intron (SEQ ID NO:25) was 93-bp long and was Inserted between nucleotides 203 and 204 
of SEQ ID NO:6, exactly as it appeared in natural context in the lambda clone described in 
Example 29. This intron-containing version of the expression cassette was designated 
pWCo-2. 

Example 31: Demonstration of Maize Protox-1 Promoter Activity in Transgenic Maize 

Plants 

Maize plants transformed with maize protox promoter/altered protox fusions were 
identified using PGR analysis with primers specific for the transgene. Total RNA was 
prepared from the PGR positive plants and reverse-transcribed using Superscript M-MLV 
(Life Technologies) under recommended conditions. Two microliters of the reverse 
transcription reaction was used in a PGR reaction designed to be specific for the altered 
protox sequence. While untransformed controls give no product in this reaction, 
approximately 85% of plants transfomrted with pWGo-1 gave a positive result, indicating the 
presence of mRNA derived from the transgene. This demonstrates some level of activity for 
the maize protox promoter. The RNA's from the transgenic maize plants were also 
subjected to standard northem blot analysis using the radiolabeled maize protox cDNA 
fragment from SEQ ID N0:6 as a probe. Protox-1 mRNA levels significantly above those of 
untransformed controls were detected in some of the transgenic maize plants. This 
elevated mRNA level is presumed to be due to expression of altered protox-1 mRNA from 
the cloned maize protox promoter. 
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Example 32: Isolation of a Sugar Beet Protox-1 Promoter Sequence 
A genomic sugar beet library was prepared by Stratagene in the Lambda Fix II 
vector. Approximately 300,000 pfu of the library was plated and probed with the sugar beet 
protox-1 cDNA sequence (SEQ ID N0:17) as described for maize in Example 29. Analysis 
by restriction digest, hybridization patterns and DNA sequence analysis identified a lambda 
clone containing approximately 7 kb of sugar beet genomic DNA located 5' to the sugar 
beet coding sequence previously isolated as a cDNA clone. A Psf/-Sa// fragment of 2606- 
bp was subcloned from the lambda clone into a pBluescript vector. This fragment contains 
2068-bp of 5' noncoding sequence and includes the sugar beet protox-1 promoter 
sequence. It also includes the first 453-bp of the protox-1 coding sequence and the 85-bp 
first intron contained in the coding sequence. The sequence of this fragment is set forth in 
SEQ ID NO:26. 

A plasmid containing the sequence of SEQ ID NO:26 was deposited December 6. 
1996 as pWDC-20 (NRRL #8-21650). 

Example 33: Construction of Plant Transfomfiation Vectors Expressing Altered Sugar Beet 
Protox-1 Genes Behind the Native Sugar Beet Protox-1 Promoter 
The sugar beet genomic fragment (SEQ ID NO:26) was excised from the genomic 
subclone described in Example 32 as a Sac/-SsrG/ fragment that Includes 2068-bp of 5' 
noncoding sequence and the first 300-bp of the sugar beet protox-1 coding sequence. This 
fragment was ligated to a 8srG/-A/of/ fragment derived from an altered sugar beet protox-1 
cDNA that contained a tyrosine to methionine change at amino acid 449 (SEQ ID NO:18), 
This created a fusion of the native sugar beet protox-1 promoter to a full length cDNA that 
had been shown to confer heriblcide tolerance in a bacterial system (Examples 9-14). This 
fusion was cloned Into a pUC18 derived vector containing the CaMV 35S temiinator 
sequence to create a protox promoter/altered protox cDNA/tenninator cassette. The 
plasmid containing this cassette was designated pWCo-3. 

Example 34: Production of HertDicide Tolerant Plants by Expression of a Native Sugar Beet 
Protox-1 Promoter/Altered Sugar Beet Protox-1 Fusion 
The expression cassette from pWCo-3 is transfonmed into sugar beet using any of 
the transformation methods applicable to dicot plants, including Agrobacterium, protoplast, 
and biolistic transfomnation techniques. Transgenic sugar beets expressing the altered 
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protox-1 enzyme are identified by RNA-PCR and tested for tolerance to protox-lnhibiting 
herbicides at concentrations that are lethal to untransformed sugar beets. 



Section D: Expression of Protox Genes in Plant Plastids 

Example 35: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and Native cIpPS' Untranslated Sequence Fused to a GUS Reporter Gene and 
Plastid rps16 Gene 3' Untranslated Sequence in a Plastid Transfonnation Vector 

I. Amplification of the Tobacco Plastid cIpP Gene Promoter and Complete 5' Untranslated 
RNA (5' UTR). 

Total DNA from N. tabacum c.v. "Xanthi NC" was used as the template for PCR with 
a left-to-right "top strand" primer comprising an introduced EcoRI restriction site at position - 
197 relative to the ATG start codon of the constitutively expressed plastid c/pP gene (primer 
Pclp_P1a: 5'-GCGGMIICATACTTATTTATCATTAGAAAG-3' (SEQ ID NO:27); EcoRI 
restriction site underilned) and a right-to-left "bottom strand" primer homologous to the 
region from -21 to -1 relative to the ATG start codon of the cIpP promoter that incorporates 
an introduced Ncol restriction site at the start of translation (primer Pclp_P2b: 5'- 
GCGCCAIGGTAAATGAAAGAAAGAACTAAA-3' (SEQ ID NO:28); Wco/ restriction site 
underiined). This PCR reaction was undertaken with Pfu thermostable DNA polymerase 
(Stratagene, La Jolla CA) in a Pertcin Elmer Themial Cycler 480 according to the 
manufacturer's recommendations (Pert<in Elmer/Roche, Branchburg. NJ) as follows: 7 min 
QS'^C. followed by 4 cycles of 1 min 95°C / 2 min 43'*C / 1 min 72'*C, then 25 cycles of 1 min 
QS^'C 1 2 min 55°C / 1 min 72°C. The 213-bp amplification product comprising the promoter 
and 5' untranslated region of the c/pP gene containing an Eco/?/ site at its left end and an 
Ncol site at its right end and corresponding to nucleotides 74700 to 74505 of the N. 
fa/?aci;m plastid DNA sequence (Shinozaki ef a/., EMBOJ. 5: 2043-2049 (1986)) was gel 
purified using standard procedures and digested with EcoRI and Ncol (all restriction 
enzymes were purchased from New England Biolabs, Beverly, MA). 

II. Amplification of the Tobacco Plastid rps16 Gene 3' Untranslated RNA Sequence 
(3'UTR). 

Total DNA from N tabacum c.v. "Xanthi NC" was used as the template for PCR as 
described above with a left-to-right "top strand" primer comprising an introduced Xbal 
restriction site immediately following the TAA stop codon of the plastid rps16 gene encoding 
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libosomal protein S16 (primer rps16P_1a (5'- 

GCGICIAGATCAACCGAAATTCAATTAA6G-3' (SEQ ID NO:30); Xbal restriction site 
underiined) and a right-to-left "bottom strand" primer homologous to the region from +134 to 
+151 relative to the TAA stop codon of rpsi 6 that incorporates an introduced Hindlll 
restriction site at the 3' end of the rps163' UTR (primer rps16P_1b (5'- 
CGCAAGCnCAATGGAAGCAATGATAA-3' (SEQ ID N0:31); Hindlll restriction site 
underiined). The 169-bp amplification product comprising the 3' untranslated region of the 
rpsW gene containing an Xbal site at its left end and a Hindlll site at its right end and 
containing the region corresponding to nucleotides 4943 to 5093 of the N. tabacum plastid 
DNA sequence (Shinozaki etaL, 1986) was gel purified and digested with Xbal and Hindlll. 

III. Ligation of a GUS Reporter Gene Fragment to the cIpP Gene Promoter and 5' and 3* 
UTR's. 

An 1864-bp p-glucuronidase (GUS) reporter gene fragment derived from plasmid 
pRAJ275 (Clontech) containing an Ncol restriction site at the ATG start codon and an Xbal 
site following the native 3' UTR was produced by digestion with Ncol and Xbal. This 
fragment was ligated in a four-way reaction to the 201 -bp EcoRUNcol c/pP promoter 
fragment, the 157-bp XballHindlll /ps;6 3'UTR fragment, and a 3148-bp EcoRI/Hindlll 
fragment from cloning vector pGEM3Zf(-) (Promega, Madison Wl) to construct plasmid 
pPH138. Plastid transformation vector pPH140 was constructed by digesting plasmid 
pPRVII la (Zoubenko etal. 1994) with EcoRI and Hindlll and ligating the resulting 7287-bp 
fragment to a 2222-bp Eco/?//H/nc//// fragment of pPH138. 

Example 36: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter Plus Tobacco Plastid psbA Gene Minimal 5' Untranslated Sequence Fused to a 
GUS Reporter Gene and Plastid rps16 Gene 3' Untranslated Sequence in a Plastid 

Transfomriation Vector 
Amplification of the tobacco plastid cIpP g^ne promoter and truncated 5' 
untranslated RNA (5' UTR): Total DNA from N. tabacum c.v. "Xanthi NC" was used as the 
template for PGR as described above with the left-to-right "top strand" primer Pclp_P1a 
(SEQ ID NO:27) and a right-to-left "bottom strand" primer homologous to the region from - 
34 to -1 1 relative to the ATG start codon of the cIpP promoter that incorporates an 
introduced Xbal restriction site in the cIpP 5' UTR at position -1 1 (primer Pclp_P1b: 5'- 
GCGICIAGAAAGAACTAAATACTATATTTCAC-3' (SEQ ID NO:29); Xda/ restriction site 
underiined). The 202-bp amplification product comprising the promoter and taincated 5' 
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UTR of the cIpP gene containing an EcoRI site at its left end and an Xbal site at its right 
end was gel purified and digested with XbaL The Xbal site was subsequently filled in with 
Klenow DNA polymerase (New England Biolabs) and the fragment digested with EcoRL 
This was ligated in a five-way reaction to a double stranded DNA fragment conresponding to 
the final 38 nucleotides and ATG start codon of the tobacco plastid psbA gene 5* UTR (with 
an Ncol restriction site overtiang introduced into the ATG start codon) that was created by 
annealing the synthetic oligonucleotides minpsb_U (top strand: 5'- 
GGQAQTCCCTGATGATTAAATAAACCAAGATTTTAC-3' (SEQ ID NO:32)) and minpsb^L 
(bottom strand: 5'-CMGGTAAAATCTTGGTTTATTTAATCATCAGGGACTCCC-3* (SEQ ID 
NO:33); Wco/ restriction site 5" overtiang underlined), the Ncot/Xbal GUS reporter gene 
fragment described above, the Xbal/Hindlll rpsW 3VTR fragment described above, and the 
EcoRI/ Hlndlll pGEM3Zf(-) fragment described above to construct plasmid pPH1 39. Plastid 
transfonnation vector pPH1 44 was constructed by digesting plasmid pPRV1 1 1 a (Zoubenko, 
ef a/.. Nucleic Acids Res 22: 3819-3824 (1994)) with EcoRI and Hlndlll and ligating the 
resulting 7287-bp fragment to a 2251-bp Ecofl//H/ncf/// fragment of pPH139- 

Example 37: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 

Promoter and Complete 5' Untranslated Sequence Fused to the Arabidopsis thallana 
Protox-1 Coding Sequence and Plastid rps16 Gene 3' Untranslated Sequence in a Vector 

for Tobacco Plastid Transfomnatlon 
Miniprep DNA from plasmid AraC-2Met carrying an Arabidopsis thallana NotI insert 
that includes cDNA sequences from the Protoporphyrinogen IX Oxidase ("protox") gene 
encoding a portion of the amino terminal plastid transit peptide, the full-length cDNA and a 
portion of the 3' untranslated region was used as the template for PCR as described above 
using a left-to-right "top strand" primer (with homology to nucleotides +172 to +194 relative 
to the ATG start codon of the full length precursor protein) comprising an introduced Ncol 
restriction site and new ATG start codon at the deduced start of the mature protox protein 
coding sequence (primer APRTXPIa: 5'- 

GGGACCAIGGATTGTGTGATTGTCGGCGGAGG-3' (SEQ ID NO:34); A/co/ restriction site 
underilned) and a right-to-left "bottom strand" primer homologous to nucleotides +917 to 
+940 relative to the native ATG start codon of the protox precursor protein (primer 
APRTXPIb: 5'- CTCCGCTCTCCAGCTTAGTGATAC-3' (SEQ ID NO:35)). The 778-bp 
product was digested with Ncol and Sful and the resulting 682-bp fragment ligated to an 
844-bp Sful/NotI DNA fragment of AraC-2Met comprising the 3' portion of the protox coding 
sequence and a 2978-bp Ncol/ NotI UagmenX of the cloning vector pGEM5Zf(+) (Promega, 
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Madison Wl) to construct plasmid pPH141 . Plastid transformation vector pPH1 43 
containing the cIpP pronnoter driving the Fonnula XVII-resistant AraC-2Met protox gene with 
the rps163' UTR was constructed by digesting pPH141 with Ncoland Sspl and isolating 
the 1491 -bp fragment containing the complete protox coding sequence, digesting the 
rps16P.1a and rps16P_1b PGR product described above with Hindlll, and ligating these to 
a 7436-bp Ncol/Hindlll UagmerW of pPH140, 

Example 38: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter Plus Tobacco Plastid psbA Gene Minimal 5' Untranslated Sequence Fused to the 

Arabidopsis thaliana Protox-1 Coding Sequence and Plastid rps16 Gene 3' Untranslated 
Sequence in a Vector for Tobacco Plastid Transfomnation 
Plastid transformation vector pPH145 containing the c/pP promoter/psM 5' UTR 
fusion driving the Formula XVII-resistant AraC-2Met protox gene with the rps163' UTR was 
constructed by digesting pPH141 with Ncol and Sspl and isolating the 1491 -bp fragment 
containing the complete protox coding sequence, digesting the rps16P_1a and rps16P_lb 
PCR product described above with HindlU, and ligating these to a 7465-bp Ncol/Hindlll 
fragment of pPH144. 

Example 39: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to the EPSP Synthase Coding Sequence 
and Plastid rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid 

Transformation 

A cDNA library Is screened for the 5-enolpyruvyl-3-phosphoshikimate synthase 
(EPSP synthase) gene (U.S. Patent Nos. 5,310,667, 5,312,910, and 5,633,435, all 
incorporated herein by reference). A plasmid clone containing the full length EPSP 
synthase gene cDNA is Isolated by standard techniques of molecular cloning. PCR primers 
are designed for amplification of the mature-size EPSP synthase coding sequence from this 
plasmid using a top strand primer having a 5' extension containing an A/co/ restriction site 
inserted at amino acid -1 from the deduced mature protein start, thus creating an ATG start 
codon at this position, and a bottom strand primer having a 5* extension containing an Xbal 
restriction site downstream of the stop codon of the EPSP mature coding sequence in the 
amplified PCR product. The PCR amplification is performed using the designated primers 
and plasmid DNA template according to standard protocols. Amplified products are cloned 
and sequenced and a Ncol-Xbal DNA fragment containing the complete mature EPSP 
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synthase coding sequence is isolated by restriction digest with Ncol and Xbal, 
electrophoresis on a 0.8% TAE agarose gel, and phenol extraction of the excised band. 

A plastid transfonmation vector containing the c/pP promoter directing transcription of 
the mature-sized EPSP synthase gene with the ips163' UTR Is constructed by digesting 
pPH140 with Ncol and Xbal and purifying the fragment containing the vector backbone, 5' 
and 3' plastid integration targeting sequences, aadW selectable marker cassette, and cIpP 
promoter / /psTS 3' UTR expression sequences. This product is ligated in a two-way 
reaction with the Ncol-Xbal DNA fragment containing the mature-sized EPSP synthase 
coding sequence isolated as described above. 

Example 40: Preparation of a Chimeric Gene Containing the Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to the ALS Coding Sequence and Plastid 
rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid Transformation 

A cDNA library is screened for the acetolactate synthase (ALS) gene (U.S. Patent 
No. 5,013,659). A plasmid clone containing the full length ALS gene cDNA is isolated by 
standard techniques of molecular cloning. PCR primers are designed for amplification of the 
mature-size ALS coding sequence from this plasmid using a top strand primer having a 5' 
extension containing an Ncol restriction site inserted at amino acid -1 from the deduced 
mature protein start, thus creating an ATG start codon at this position, and a bottom strand 
primer having a 5' extension containing an Xbal restriction site downstream of the stop 
codon of the ALS mature coding sequence in the amplified PCR product. The PCR 
amplification is perfomied using the designated primers and plasmid DNA template 
according to standard protocols. Amplified products are cloned and sequenced and a Ncol- 
Xbal DNA fragment containing the complete mature ALS coding sequence is isolated by 
restriction digest with Ncol and Xbal, electrophoresis on a 0.8% TAE agarose gel, and 
phenol extraction of the excised band. 

A plastid transfomiation vector containing the c/pP promoter driving the mature-sized 
ALS gene with the rps16 3' UTR is constaicted by digesting pPH1 40 with Ncol and Xbal 
and purifying the fragment containing the vector backbone, 5' and 3' plastid integration 
targeting sequences, aadA selectable maricer cassette, and cIpP promoter / rpsIB 3' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized ALS coding sequence isolated as described 
above. 
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Example 41 : Preparation of a Chimeric Gene Containing tlie Tobacco Plastid cIpP Gene 
Promoter and 5' Untranslated Sequence Fused to tlie AHAS Coding Sequence and Plastid 
rp$16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid Transformation 

A cDNA library is screened for the acetohydroxyacid synthase (AHAS) gene (U.S. 
Patent No. 4,761 .373). A plasmid clone containing the full length AHAS gene cDNA is 
isolated by standard techniques of molecular cloning. PCR primers are designed for 
amplification of the mature-size AHAS coding sequence from this plasmid using a top 
strand primer having a 5' extension containing an Ncol restriction site inserted at amino acid 
-1 from the deduced mature protein start, thus creating an ATG start codon at this position, 
and a bottom strand primer having a 5' extension containing an Xbal restriction site 
downstream of the stop codon of the AHAS mature coding sequence in the amplified PCR 
product. The PCR amplification is perfonmed using the designated primers and plasmid 
DNA template according to standard protocols. Amplified products are cloned and 
sequenced and a NcohXbal DNA fragment containing the complete mature AHAS coding 
sequence is isolated by restriction digest with Ncol and Xbal, electrophoresis on a 0.8% 
TAE agarose gel, and phenol extraction of the excised band. 

A plastid transfomiation vector containing the c/pP promoter driving the mature-sized 
AHAS gene with the rps16 3' UTR is constmcted by digesting pPHI 40 with Ncol and Xbal 
and purifying the fragment containing the vector backbone, 5' and 3" plastid integration 
targeting sequences, aadA selectable marker cassette, and c/pP promoter / rps163' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized AHAS coding sequence isolated as described 
above. 

Example 42: Preparation of a Chimeric Gene Containing the Tobacco Plastid c/pP Gene 
Promoter and 5' Untranslated Sequence Fused to the ACCase Coding Sequence and 
Plastid rps16 Gene 3' Untranslated Sequence in a Vector for Tobacco Plastid 
Transformation 

A cDNA library is screened for the acetylcoenzyme A carboxylase (ACCase) gene 
(U.S. Patent No. 5,162,602). A plasmid clone containing the full length ACCase gene 
cDNA is isolated by standard techniques of molecular cloning. PCR primers are designed 
for amplification of the mature-size ACCase coding sequence from this plasmid using a top 
strand primer having a 5' extension containing an Ncol restriction site inserted at amino acid 
-1 from the deduced mature protein start, thus creating an ATG start codon at this position, 
and a bottom strand primer having a 5' extension containing an Xbal restriction site 
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downstream of the stop codon of the ACCase mature coding sequence in the amplified 
PGR product. The PGR amplification is perfonmed using the designated primers and 
plasmid DNA template according to standard protocols. Amplified products are cloned and 
sequenced and a Ncol-Xbal DNA fragment containing the complete mature AGCase 
coding sequence is isolated by restriction digest with Ncol and Xbal, electrophoresis on a 
0.8% TAE agarose gel, and phenol extraction of the excised band. 

A plastid transfomnation vector containing the cIpP promoter driving the mature-sized 
AGGase gene with the rp$16 3' UTR is constmcted by digesting pPH140 with Ncol and 
Xbaland purifying the fragment containing the vector backbone, 5' and 3' plastid integration 
targeting sequences, aacM selectable marker cassette, and c/pP promoter / ip$163' UTR 
expression sequences. This product is ligated in a two-way reaction with the Ncol-Xbal 
DNA fragment containing the mature-sized AGGase coding sequence isolated as described 
above. 

Example 43: Biolistic Transfomiation of the Tobacco Plastid Genome 
Seeds of Nicotiana tabacum c.v. 'Xanthi nc' were germinated seven per plate in a 1" 
circular array on T agar medium and bombarded 12-14 days after sowing with 1 /im 
tungsten particles (M10, Biorad, Hercules, GA) coated with DNA from plasmids pPH143 and 
pPH145 essentially as described in Svab, Z. and Maliga, P. (1993) PNAS 90, 913-917. 
Bombarded seedlings were incubated on T medium for two days after which leaves were 
excised and placed abaxial side up in bright light (350-500 pmol photons/m^/s) on plates of 
RMOP medium (Svab, Z, Hajdukiewicz, P. and Maliga, P. (1990) PNAS 87, 8526-8530) 
containing 500 pg/ml spectlnomycin dihydrochloride (Sigma. St. Louis, MO). Resistant 
shoots appearing underneath the bleached leaves three to eight weeks after bombardment 
were subcloned onto the same selective medium, allowed to form callus, and secondary 
shoots isolated and subcloned. Gomplete segregation of transformed plastid genome 
copies (homoplasmlcity) in independent subclones was assessed by standard techniques of 
Southern blotting (Sambrook etal., (1989) Molecular Cloning: A Laboratory Manual, Gold 
Spring Harbor Laboratory, Gold Spring Harbor). BamHI/EcoRMlgesXedi total cellular DNA 
(Mettler, I. J. (1 987) Plant MolBlol Reporters, 346-349) was separated on 1% Tris-borate 
(TBE) agarose gels, transferred to nylon membranes (Amersham) and probed with ^P- 
labeled random primed DNA sequences corresponding to a 0.7 kb BamHI/Hindlll DNA 
fragment from pG8 containing a portion of the rps7/12 plastid targeting sequence. 
Homoplasmic shoots are rooted aseptically on spectihomycin-containing MS/IBA medium 
(McBride. K. E. etal. (1994) PAMS 91. 7301-7305) and transferred to the greenhouse. 
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Example 44: Assessment of Herbicide Tolerance in Nt_pPH143 and Ntj)PH145 Plastid 

Transfonmant Lines 

Primary homoplasmic transformant lines transformed with pPH143 (line NtjDPH143) 
or with pPH145 (line Nt„pPH145), which were obtained as described in Example 43, were 
grown to maturity in the greenhouse. Flowers were either: (a) self-pollinated, (b) pollinated 
with wildtype tobacco (c.v. Xanthi nc), or (c) used as pollen donors to fertilize emasculated 
flowers of wildtype Xanthi plants. Plastid segregation of the linked spectinomycin 
resistance marker was verified by uniparental female inheritance of the spectinomycin- 
resistance phenotype in each transformant line using a minimum of 50 seeds per selection 
pool derived from either selfed or backcross capsules. Additional self or wildtype backcross 
(Xanthi pollen parent) seeds were genninated in soil. 36 plants of each line (143 1B-1, 143 
1B-4, 143 4A-2. 143 4A-5, 145 7A-5, 145 7A-6, 145 8A-3) plus 36 wildtype Xanthi plants as 
isogenic controls were grown in separate 6" clay pots in a controlled environment cubicle. 
In order to assess tolerance to the protox inhibitor Formula XVII, plants of Xanthi and the 
seven transfomnant lines were distributed Into eight identical 16-pot flats (2 plants of each 
type per flat). The flats were sprayed with Fonmula XVII until runoff at concentrations of 
either 0. 0.5, 2.5, 5, 10, 25, 50, or 100 mg Formula XVII per liter. Solutions were made up 
in water using 4 g/liter or 40 g/liter stock solutions of Fomiula XVII dissolved in 
dimethylsulfoxide (DMSO) and used immediately after preparation. Twenty microliters of 
the wetting agent Silwett was added to each 200 ml volume of hert>icide solution for a final 
concentration of 0.01%. Flats were sprayed in the late aftemoon and allowed to dry 
overnight before transfer to the growth cubicle. Tolerance was assesed by comparing leaf 
damage and wilting to the untransfonmed Xanthi controls at 0, 18 hrs, 48 hrs, and 6 days 
post-application. Severe damage was apparent on the Xanthi plants at all concentrations 
above 0.5 mg/l, and complete wilting/bum down occunred above 2.5 mg/l. Only slight 
damage occunred on the Nt_pPH143 plants even at the highest concentration (100 
mg/liter), and the plants soon outgrew the bleached spots (the appearance of Xanthi at 0.5 
mg/liter was approximately equivalent to Nt_pPH143 1B-1 at 100 mg/liter. giving a tolerance 
of ca. 200-fold). 

Example 45: Plastid Transformation of Maize 
Type I embryogenic callus cultures (Green ef a/. (1983) in A. Fazelahmad, K. 
Downey, J. Schultz, R.W. Voellmy, eds. Advances in Gene Technology: Molecular 
Genetics of Plants and Animals. Miami Winter Symposium Series, Vol. 20. Academic Press, 



103 



wo 01/68826 PCT/EPOl/02753 

N.Y.) Of the proprietary genotypes CG00526 and CG00714 are initiated from immature 
embryos, 1.5 - 2.5 mm in length, from greenhouse grown material. Embryos are aseptically 
excised from surface-sterilized ears approximately 14 days after pollination. Embryos of 
CG00526 are placed on D callus initiation media with 2% sucrose and 5mg/L chloramben 
(Duncan etaL (1985) Plants 165: 322-332) while those of CG00714 are placed onto KM 
callus initiation media with 3% sucrose and 0.75mg/L 2,4-d (Kao and Michayluk (1975) 
Planta 126, 105-1 10). Embryos and embryogenic cultures are subsequently cultured in the 
dari<. Embryogenic responses are removed from the explants after -14 days. CG00526 
responses are placed onto D callus maintenance media with 2% sucrose and 0.5mg/L 2.4-d 
while those of CG00714 are placed onto KM callus maintenance media with 2% sucrose 
and 5mg/L Dicamba. After 3 to 8 weeks of weekly selective subculture to fresh 
maintenance media, high quality compact embryogenic cultures are established. Actively 
growing embryogenic callus pieces are selected as target tissue for gene delivery. The 
callus pieces are plated onto target plates containing maintenance medium with 12% 
sucrose approximately 4 hours prior to gene delivery. The callus pieces are arranged in 
circles, with radii of 8 and 10mm from the center of the target plate. Plasmid DNA is 
precipitated onto gold microcarriers as described in the DuPont Biolistics manual. Two to 
three jig of each plasmid is used in each 6 shot microcarrier preparation. Genes are 
delivered to the target tissue cells using the PDS-IOOOHe Biolistics device. The settings on 
the Biolistics device are as follows: 8 mm between the rupture disc and the macrocarrier, 
10 mm between the macrocarrier and the stopping screen and 7 cm between the stopping 
screen and the target. Each target plate is shot twice using 650psl rupture discs. A 200 X 
200 stainless steel mesh (McMaster-Carr, New Brunswick, NJ) is placed between the 
stopping screen and the target tissue. 

Five days later, the bombed callus pieces are transferred to maintenance medium 
with 2% sucrose and 0.5mg/L 2,4-d, but without amino acids, and containing 750 or 1000 
nM Formula XVII. The callus pieces are placed for 1 hour on the light shelf 4-5 hours after 
transfer or on the next day, and stored in the daric at 27''C for 5-6 weeks. Following the 5-6 
week primary selection stage, yellow to white tissue is transfenred to fresh plates containing 
the same medium supplemented with 500 or 750 nM Fonmula XVII. 4-5 hours after transfer 
or on the next day, the tissues are placed for 1 hour on the light shelf and stored in the dark 
at 27°C for 3-4 weeks. Following the 3-4 week secondary selection stage, the tissues are 
transferred to plates containing the same medium supplemented with 500 nM Formula XVII. 
Healthy growing tissue is placed for 1 hour on the light shelf and stored in the dari^ at 27**C. 
It is subcultured every two weeks until the colonies are large enough for regeneration. 
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At that point, colonies are transferred to a modified MS medium (Murashige and 
Skoog (1962) Pliysiol. Plant 15: 473-497) containing 3% sucrose (MS3S) with no selection 
agent and placed in the light. For CG00526. 0.25mg/L ancymidol and 0.5mg/L kinetin are 
added to this medium to induce embryo genrnination, while for CG00714, 2mg/L benzyl 
adenine is added. Regenerating colonies are transfen-ed to MS3S media without ancymidol 
and kinetin, or benzyl adenine, for CG00526 or CG00714, respectively, after 2 weeks. 
Regenerating shoots with or without roots are transferred to boxes containing MS3S 
medium and small plants with roots are eventually recovered and transferred to soil in the 
greenhouse. 

Example 46: Determination of the NH2-terminus of the mature maize 
protoporphyrinogen oxidase 

Maize seedlings are grown 6 days at 27'C In the dark and illuminated 30 min prior to 
recovering etioplasts. Leaves are homogenised in extraction buffer (20 mM TES, 10 mM 
Hepes, 0.5 M sucrose. 1 mM MgCIa, 1 mM EDTA, 0.2% BSA, pH 7.7, ±5 mM DTT) and 
filtered through cheesecloth layers. After centrifugation (7 min x lOOOOg) the pellet is 
resuspended in the same buffer and centrifuged 4 min x 150g. The supernatant is 
centrifuged 10 min x 6000g, and the pellet resuspended in a small amount of buffer. 

For Immunopreclpitation, the chloroplast fraction is diluted 5 fold in 20mM K- 
phosphate buffer, pH 7.0, 0.2 % Triton X-100, put on ice for 30 min and sonicated (3x6 
sec) before being mixed with Protein A Sepharose CL-4B (Pharmacia). The supernatant is 
recovered by centrifugation and incubated with Protein A Sepharose CL-4B coupled to 
purified anti- protoporphyrinogen oxidase antibodies (anti- protox-1). The resin is washed 
with 20 mM Na-Phosphate buffer, pH 7.0, and specifically bound proteins are eluted by 
boiling in 3% SDS. Detergent is precipitated at 4^C, the supernatant lyophilised by speed- 
vacuum and resuspended in 20 mM Na-Phosphate buffer, pH 7.0. 

Proteins are separated by SDS-electrophoresis according to Laemmli (1970) Nature 
(London) 227, 680-685, and silver stained using a Blorad (Hercules, CA) kit. The proteins 
separated by SDS-electrophoresis are blotted onto nitrocellulose (Milllpore). Purified anti- 
protoporphyrinogen oxidase antibodies are diluted 1:15,000 before use. Anti rabbit IgG 
alkaline phosphatase conjugates (Sigma) are used as secondary antibodies and 
protoporphyrinogen oxidase bands are stained by using a nitroblue tetrazolium/ 5-bromo-4- 
chloro-3-indolyl phosphate kit (Biorad). 

Protein samples for NHa-sequencing are prepared as previously described (de 
Marco, A., Guzzardi, P., & Jamet, E. (1999) Plant Physiol. 120, 371-381), and Edman 
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analysis is carried out in a model 477A protein sequencer (Applied Biosystems Inc., Foster 
City, CA) according to the recommendations of the manufacturer. The NH2-terminus of the 
mature maize protoporphyrinogen oxidase (protox-1) is found to be located at the alanine at 
position 1 in the amino acid sequence set forth in SEQ ID NO:45. 

Purified recombinant protoporphyrinogen oxidase (see e.g. this application) is used 
to raise antibodies in rabbit using methods well known in the art. The antibodies are 
purified from sera using EMDTA Fractogel (Merck). 

Example 47: Recombinant expression of the mature maize protoporphyrinogen oxidase 

A. Construction of expression vectors and cell transformation. 

The pET16b and pET28 bacterial expression vector (Novagen, Madison, USA), 
based on the T7 promoter driven system and containing a 5* His-tag is used. The coding 
sequence of the mature maize protox-1 is amplified by PCR. The PCR product is cloned 
into the BamHI site of pET16b. A plasmid containing the con-ect orientation of the insert is 
identified by sequencing and used for transfonmation of competent BL21 (DE3) and BI-21 
(DL3)pLysS cells. 

B. Recombinant protein isolation. 

E. CO// cells, transfomied with the vector hosting the mature maize 
protoporphyrinogen oxidase coding sequence, are cultured 16 h at 36*'C under constant 
agitation. Isopropyl-P-D-thiogalactopyranoside (iPTG) is added to the medium 2 h before 
bacterial recovery. Alternative growth conditions are also used to increase the accumulation 
of active protein. Rrst, induction of heat-shock proteins Is stimulated by culturing the 
bacteria initially at SO^'C and then increasing to 42**C in the presence of 3% ethanol. 
Second, the uptake of the compatible solute giycyl betaine is performed using sorbitol as an 
osmotic. Bacterial cells are recovered by centrifugation (15 min x 7500gf) and stored at - 
80°C. The bacterial cells are resuspended in extraction buffer (40 mM Na-Phosphate, pH 
7.2, 300 mM NaCI. 20% glycerol, 0.2% Triton X-100, 20 mM imidazole), sonicated (6x6 
sec), and then centrifuged (25 min x lOOOOOg). The supematant is mixed with 2 mL of pre- 
washed immobilized nickel affinity chromatography resin (Talon, Clontech Lab.) in a 50 mL 
Falcon tube and gently rocked for 1 0 min, at 4*'C. After centrifugation (2 min x 700 rpm), the 
supematant is removed and the resin washed in extraction buffer (2X). Different protocols 
are used to complete the protein purification and described below. 

Method 1): The resin is used to load a column and subsequently washed with 10 
volumes of washing buffer (50 mM EPPS, pH 7.2, 20 % glycerol, 0.1% TritonX -100). Bound 
proteins are eluted with the washing buffer containing 100 mM EDTA, desalted and buffer 
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exchanged (50 mM EPPS. pH 8.5, 1 mM EDTA) using PD-10 Sephadex G-25 columns 
(Phamnacia), and loaded onto an ion-exchange iVIono-Q FPLC column (Phanmacia). 
Proteins are eluted using a 0-1 M NaCI gradient. Fractions conresponding to the different 
proteins are collected separately, concentrated by speed-vacuum and dialysed against 
washing buffer. Alternatively, fractions recovered by affinity chromatography are buffer 
exchanged with 20 mM EPPS, pH 7.2, 150 mM NaCI and are loaded (- 150 ^iL 
con^esponding to 150 ng protein) onto a pre-equilibrated gel-filtration Superdex 75 column 
(Pharmacia). 

Method 2): Proteins are eluted as in Method 1), buffer exchanged against washing 
buffer, loaded onto a column, washed twice in ATP buffer (50 mM EPPS, pH 7.2, 20 % 
glycerol, 5 mM ATP. 7 mM MgCI2, 0.1% TritonX -100) and eluted with imidazole buffer (50 
mM EPPS. pH 7.2, 20% glycerol. 0.1% TritonX -100, 1 M imidazole). 

Method 3): After the initial extraction buffer washing steps, the metal affinity resin is 
resuspended In ATP buffer and agitated for 30 min at 4 "'C. The resin is recovered by 
centrifugation and washed again in the ATP buffer before being loaded onto a column and 
re-equilibrated with 50 mM EPPS, pH 7.2, 20% glycerol, 0.1% TritdnX -100. Protein is 
eluted with imidazole buffer (described above). In some extractions, phospholipids (Sigma, 
St. Louis) were added to the elution buffer (L-a-phosphatldylcholine; L-a- 
phosphatidylcholine, dioleoyi; L-a-phosphatidyl-ethanolamine; L-a-phosphatidyl-L-serine; L- 
a-phosphatidylinositol; L-a-IysophosphatidylinosltoI; L-a-lysophosphatidylcholine; L-a- 
lysophosphatidyl-L-serine; L-a-lysophosphatidylethanolamine). 

Resuspension of heat denaturated protein and of protein from homogenate pellet Is 
performed using extraction buffer or re-equilibration buffer containing either phospholipides 
or chaperones. Proteins not immediately used for enzymatic tests are stored at -80 °C. The 
His-tag is removed by incubation in the presence of Factor Xa (New England Biolabs), 
according to manufacturer's instructions. 



Table 1 A 

Alignment of the full-length and partial protox-1 amino acid sequences from 
Arabidopsis ("Arabpt-1 SEQ ID NO:2), maize ("Mzpt-I"; SEQ ID NO:6). wheat ("Wtpt-I"; 
SEQ ID NO:10). soybean ("Soybeanpt-I"; SEQ ID NO:12), cotton ("Cottonpt-I"; SEQ ID 
N0:16). sugar beet ("Sugpt-1"; SEQ ID N0:18), oilseed rape ("Rapept-I"; SEQ ID NO:20), 
rice ("Ricept-r; SEQ ID NO:22). sorghum ("Sorghumpt-I"; SEQ ID NO:24). and sugar cane 
("Scpt-r; SEQ ID NO:37). Alignment was performed using the PileUp program (GCG 
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package, University of Wisconsin, l\/Iadison, Wl). Positions that may be modified according 
to the teachings herein to confer or enhance inhibitor resistance are shown in bold type. 



1 50 

Rapept--! MDLSLLRP. . QPFLSPFSNP FPRSRPYKPL 

Arabpt-1 MELSLLRPTT QSLLPSFSKP NLRLNVYKPL 

Sorghumpt-l 

Mzpt-1 

Wtpt-1 M ATATVAAASP LRGRVTGRPH 

Ricept-1 

Cottonpt-1 MTAL IDLSLLRSSP SVSPFSIPHH QHPPRFRKPF 

Soybeanptl MV SVFNEILFPP NQTLLRPSLH SPTSFFTSPT RKFPRSRPNP 

Sugpt-1 MKSMALSNCI PQTQCMPLRS SGHYRGNCIM LSIPCSLIGR RGYYSHKKRR 

Scpt-1 

51 100 

Rapept-1 NLRCSVSGGS WGSSTIEGG GGGKTVTADC VIVGGGISGL CIAQALVTKH 

Arabpt-1 RLRCSVAGGP TVGSSKIEGG GGT^TITTDC VIVGGGISGL CIAQALATKH 

Sorghvunpt-l 

Mzpt-l ADC VWGGGISGL CTAQALATRH 

Wtpt-1 RVRPRCATAS SATETPAAPG VRL...SAEC VIVGAGISGL CTAQALATRY 

Ricept-1 

Cottonpt-1 KLRCSLAEGP TISSSKIDGG ESS...IADC VIVGGGISGL CIAQALATKH 

Soybeanptl ILRCSIAEES TASPPKTR. . DSA. . .PVDC VWGGGVSGL CIAQALATKH. 

Sugpt-1 MSMSCSTSSG SKSAVKEAGS GSGAGGLLDC VIVGGGISGL CIAQALCTKH 

Scpt-1 
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101 150 

Rapept--! PDA. .AKNVM VTEAKDRVGG NIIT. .REEQ GFLWEEGPNS FQPSDPMLTM 

Arabpt-1 PDA. .APNLI VTEAKDRVGG NIIT. .REEN GFLWEEGPNS FQPSDPMLTM 

Sorghxunpt-l STVERPEE GYLWEEGPNS FQPSDPVLSM 

Mzpt-1 ..G..VGDVL VTEARARPGG NITTVERPEE GYLWEEGPNS FQPSDPVLTM 

Wtpt-1 ..G..VSDLL VTEARDRPGG NITTVERPDE GYLWEEGPNS FQPSDPVLTM 

Ricept-1 

Cottonpt-1 RDV..ASNVI VTEARDRVGG NITTVER. .D GYLWEEGPNS FQPSDPILTM 

Soybeanptl . .A. .NANW VTEARDRVGG NITTMER. .D GYLWEEGPNS FQPSDPMLTM 

Sugpt-1 SSSSLSPNFI VTEAKDRVGG NIVTVE. .AD GYIWEEGPNS FQPSDAVLTM 

Scpt-1 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-l 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



151 

WDSGLKDDL 
WDSGLKDDL 
AVDSGLKDDL 
AVDSGLKDDL 
AVDSGLKDDL 



VLGDPTAPRF 
VLGDPTAPRF 
VFGDPNAPRF 
VFGDPNAPRF 
VFGDPNAPRF 



VLWNGKLRPV 
VLWNGKLRPV 
VLWEGKLRPV 
VLWEGKLRPV 
VLWEGKLRPV 



PSKLTDLPFF 
PSKLTDLPFF 
PSKPADLPFF 
PSKPADLPFF 
PSKPGDLPFF 



200 

DLMSIGGKIR 
DLMSIGGKIR 
DLMSIPGKLR 
DLMSIPGKLR 
SLMSIPGKLR 



AVDSGLKDDL VLGDPNAPRF VLWEGKLRPV PSKPTDLPFF DLMSIAGKLR 
WDSGLKDEL VLGDPDAPRF VLWNRKLRPV PGKLTDLPFF DLMSIGGKIR 
AVDSGLKDEL VLGDPNAPRF VLWNDKLRPV PSSLTDLPFF DLMTIPGKIR 



Rapept-1 
Arabpt-1 
Scrghtimpt-l 
Mzpt-l 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



201 

AGFGAIGIRP 
AGFGALGIRP 
AGLGALGIRP 
AGLGALGIRP 
AGLGALGIRP 



SPPGREESVE 
SPPGREESVE 
PAPGREESVE 
PPPGREESVE 
PPPGREESVE 



EFVRRNLGDE 
EFVRRNLGDE 
EFVRRNLGAE 
EFVRRNLGAE 
EFVRRNLGAE 



VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 
VFERLIEPFC 



250 

SGVYAGDPAK 
SGVYAGDPSK 
SGVYAGDPSK 
SGVYAGDPSK 
SGVYAGDPSK 



AGFGAIGIRP PPPGYEESVE EFVRRNLGAE VFERFIEPFC SGVYAGDPSK 
AGFGALGIRP PPPGHEESVE EFVRRNLGDE VFERLIEPFC SGVYAGDPSK 
AALGALGFRP SPPPHEESVE HFVRRNLGDE VFERLIEPFC SGVYAGDPAK 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-l 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



251 

LSMKAAFGKV 
LSMKAAFGKV 
LSMKAAFGKV 
LSMKAAFGKV 
LSMKAAFGKV 
RALKAAFGKV 
LSMKAAFGRV 
LSMKAAFGKV 
LSMKAAFGKV 



WKLEENGGSI 
WKLEQNGGSI 
WRLEEAGGSI 
WRLEETGGSI 
WRLEEIGGSI 
WRLEDTGGSI 
WKLEEIGGSI 
WKLEKNGGSI 
WKLEQKGGSI 



IGGAFKAIQA 
IGGTFKAIQE 
IGGTIKTIQE 
IGGTIKTIQE 
IGGTIKAIQD 
IGGTIKTIQE 
IGGTFKTIQE 
IGGTFKAIQE 
IGGTLKAIQE 



KNKAPKTTRD 
RKNAPKAERD 
RGKNPKPPRD 
RSKNPKPPRD 
KGKNPKPPRD 
RGKNPKPPRD 
RNKTPKPPRD 
RNGASKPPRD 
RGSNPKPPRD 



300 

PRLPKPKGQT 
PRLPKPQGQT 
PRLPKPKGQT 
ARLPKPKGQT 
PRLPAPKGQT 
PRLPTPKGQT 
PRLPKPKGQT 
PRLPKPKGQT 
QRLPKPKGQT 
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Rapept-1 
Arabpt-1 
Sorghximpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-1 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



301 

VGSFRKGLTM 
VGSFRKGLRM 
VASFRKGLAM 
VASFRKGLAM 
VASFRKGLAM 
VASFRKGLTM 
VGSFRKGLTM 
VGSFRKGLTM 
VGSFRKGLVM 



LPEAISARLG 
LPKAISARLG 
LPNAITSSLG 
LPNAITSSLG 
LPNAIASRLG 
LPDAITSRLG 
LPEAIANSLG 
LPDAISARLG 
LPTAISARLG 



DKVKVSWKLS 
SKVKLSWKLS 
SKVKLSWKLT 
SKVKLSWKLT 
SKVKLSWKLT 
SKVKLSWKLT 
SNVKLSWKLS 
NKVKLSWKLS 
SRVKLSWTLS 



SITKLASGEY 
GITKLESGGY 
SMTKSDGKGY 
SITKSDDKGY 
SITKADNQGY 
SITKSDNKGY 
SITKLGNGGY 
SISKLDSGEY 
SIVKSLNGEY 



350 

SLTYETPEGI 
NLTYETPDGL 
VLEYETPEGV 
VLEYETPEGV 
VLGYETPEGL 
ALVYETPEGV 
NLTFETPEGM 
SLTYETPEGV 
SLTYDTPDGL 



351 

VTVQSKSWM 
VSVQSKSWM 
VLVQAKSVIM 
VSVQAKSVIM 
VSVQAKSVIM 
VSVQAKTWM 
VSLQSRSWM 
VSLQCKTWL 
VSVRTKSWM 



TVPSHVASSL 
TVPSHVASGL 
TIPSYVASDI 
TIPSYVASNI 
TIPSYVASDI 
TIPSYVASDI 
TIPSHVASNL 
TIPSYVASTL 
TVPSYVASRL 



LRPLSDSAAE 
LRPLSESAAN 
LRPLSGDAAD 
LRPLSSDAAD 
LRPLSIDAAD 
LRPLSSDAAD 
LHPLSAAAAD 
LRPLSAAAAD 
LRPLSDSAAD 



ALSKLYYPPV 
ALSKLYYPPV 
VLSRFYYPPV 
ALSRFYYPPV 
ALSKFYYPPV 
ALSIFYYPPV 
ALSQFYYPPV 
ALSKFYYPPV 
SLSKFYYPPV 



400 

AAVSISYAKE 
AAVSISYPKE 
AAVTVSYPKE 
AAVTVSYPKE 
AAVTVSYPKE 
T^VTVSYPKE 
ASVTVSYPKE 
AAVSISYPKE 
AAVSLSYPKE 



401 

AIRSECLIDG 
AIRTECLIDG 
AIRKECLIDG 
AIRKECLIDG 
AIRKECLIDG 
AIRKECLIDG 
AIRKECLIDG 
AIRSECLIDG 
AIRSECLING 



ELKGFGQLHP 
ELKGFGQLHP 
ELQGFGQLHP 
ELQGFGQLHP 
ELQGFGQLHP 
ELQGFGQLHP 
ELKGFGQLHP 
ELKGFGQLHP 
ELQGFGQLHP 



RTQKVETLGT 
RTQGVETLGT 
RSQGVETLGT 
RSQGVETLGT 
RSQGVETLGT 
RSQGVETLGT 
RSQGIETLGT 
RSQGVETLGT 
RSQGVETLGT 



lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPNR 
lYSSSLFPGR 



450 

APPGRVLLLN 
APPGRILLLN 
APAGRVLLLN 
APDGRVLLLN 
APAGRVLLLN 
APAGRVLLLN 
APSGRVLLLN 
APPGRVLLLN 
APPGRILILS 



451 

YIGGATNTGI 
YIGGSTNTGI 
YIGGATNTGI 
YIGGATNTGI 
YIGGSTNTGI 
YIGGSTNTGI 
YIGGATNTGI 
YIGGATNTGI 
YIGGAKNPGI 



LSKSEGELVE 
LSKSEGELVE 
VSKTESELVE 
VSKTESELVE 
VSKTESDLVG 
VSKTESELVE 
LSKTEGELVE 
LSKTDSELVE 
LNKSKDELAK 
. SKTESELVE 



AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
AVDRDLRKML 
TVDRDLRKIL 
TVDKDLRRML 
AVDRDLRKML 



IKPSSTDPLV 
IKPNSTDPLK 
INPTAVDPLV 
INSTAVDPLV 
INPRAADPLA 
INPRAVDPLV 
INPNAKDPLV 
INPNAQDPFV 
INPDAKLPRV 
INPTAVDPLV 



500 

LGVKLWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
LGVRVWPKAl 
VGVRLWPQAI 
LGVRVWPQAI 
LGVRVWPQAI 
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Rapept-1 
Arabpt-1 
Sorghumpt-l 
Mzpt-l 
Wtpt-1 
Ricept-1 
Cottonpt-1 
Soybeanptl 
Sugpt-1 
Scpt-1 



501 

PQFLIGHIDL 
PQFLVGHPDI 
PQFLVGHLDL 
PQFLVGHLDL 
PQFLIGHLDR 
PQFLIGHLDH 
PQFLVGHLDL 
PQFLVGHLDL 
PQFSIGHFDL 
PQFLVGHLDL 



VDAAKASLSS 
LDTAKSSLTS 
LEAAKSALDQ 
LEAAKAALDR 
LAAAKSALGQ 
LEAAKSALGK 
LDSAKMALRD 
LDVAKASIRN 
LDAAKAALTD 
LEAAKSTUJDR 



SGHEGLFLGG 
SGYEGLFLGG 
GGYNGLFLGG 
GGYDGLFLGG 
GGYDGLFLGG 
GGYDGLFLGG 
SGFHGLFLGG 
TGFEGLFLGG 
TGVKGLFLGG 
GGYDGLFLGG 



NYVAGVALGR 
NYVAGVALGR 
NYVAGVALGR 
NYVAGVALGR 
KYVAGVALGR 
NYVAGVALGR 
NYVSGVALGR 
NYVSGVALGR 
NYVSGVALGR 
NYVAGVALGR 



550 

CVEGAYETAT 
CVEGAYETAI 
CIEGAYESAA 
CVEGAYESAS 
CIEGAYESAS 
CVEGAYESAS 
CVEGAYEVAA 
CVEGAYEVAA 
CIEGAYESAA 
CVEGAYESAS 







551 


563 


Rapept- 


1 


QVNDFMSRYA 


YK* 


Arabpt- 


1 


EVNNFMSRYA 


YK* 


Sorghumpt- 


•1 


QIYDFLTKYA 


YK* 


Mzpt- 


1 


QISDFLTKYA 


YK* 


Wtpt- 


1 


QVSDFLTKYA 


YK* 


Ricept- 


1 


QISDYLTKYA 


YK* 


Cottonpt- 


•1 


EVKEFLSQYA 


YK* 


Soybeanptl 


EVNDFLTNRV 


YK* 


Sugpt- 


1 


EWDFLSQYS 


DK* 


Scpt- 


1 


QIYDFLTKYA 


YK* 



Table IB 

Sub-sequences of herbicide-tolerant protox enzymes comprising point mutations. 



Sub- 
sequence 
# 


Sub- 
sequence AA 
sequence 


AnAA 

wild-type 


AnAA 

substitutions 


Corresponding 
AA position of 
An in Table lA 


Exemplary 
mutants 


1 


APAiF 


R 


C.L 


169 


Mz88Cys 
Mz88Leu 


2 


FAaS 


C 


F. L,K 


240 


M2l59Phe 
Mz159Leu 
Mz159Lvs 


3 


YAgG 


A 


V.T, L, CI 


245 


pAraC-1Val 

pAraC-IThr 

pAraC-1 Leu 

pAraC-1 Cys 

pAraC-llle 

pMzC-1Val 

pMzC-1Thr 

pMzC-1 Leu 

pWhtC-1Val 

pWhtC-1Thr 

pSoyC-1Thr 

pSoyC-ILeu 
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A 

4 


AA4D 


G 


S, L 


246 


pAraC-3Ser 
p ivi£ w~o oe r 
pMzC-SLeu 
pWhtC-aSer 


5 


YA5P 


P 


S.H 


388 


Soy369Ser 
Soy369His 
Cot365Ser 


6 


PAfiA 


V 


L 


390 


Wht356Leu 


7 


A7IG 


Y 


C, 1. L, T. IVl. 
V, A, H, R 


451 


pAraC-2Cys 

pAraC-2lle 

pAraC-2Leu 

pAraC-2Thr 

pAraC-2Met 

pAraC-2VaI 

pAraC-2Ala 

pMzC-2lle 

pM2C-2Met 

pSoyC-2Leu 

pSoyC-2lle 

pSugC-2Cys 

pSugC-2Leu 

pougc-2iie 

pSugC-2Val 

pSugC-2Met 

pCotC-2Cys 

pCotC-2His 

pooio-^Aig 


8 


YIGGAs 


A, S 


P 


455 


Wiit421 Pro 


n 
V 


AA9P 


1 
1 


T, H, G, N 


500 


lVlz419Thr 
l\4z4igHis 

M2419Gly 
Mz419Asn 

WhMRRThr 


10 


G^ioA 


V 


A 


536 


Wht502Ala 
Soy517Ala 


18 


KAAibF 


A 


T.V 


256 


Mz175T!ir 
Mz175Val 


19 


QAisH 


L 


S 


418 


IVIz337Ser 


Second-site mutations 


11 


QAiiS 


P 


L 


143 


AraC118Leu 


12 


IGGAiz 


T 


i.A 


274 


AraC249lle 
AraC249Ala 


13 


SWXUii3 


S,T 


L 


330 


AraC305Leu 


14 


l-AuY 


N 


S 


450 


AraC425Ser 


15 


GAisXGL 


Y, H, F, V 


c 


523 


AraC498Cys 
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Double mutation 


16 


TAieG 


L 


S 


428 


Mz347Ser- 
453Thr 


17 


YVAitG 


A.(S) 


T 


534 





Table 2 

Comparison of the Arabidopsis (SEQ ID NO:4) and maize (SEQ ID NO:8) protox-2 
amino acid sequences. Identical residues are denoted by the vertical bars between the two 
sequences. Alignment was performed using the GAP program described in Deveraux ef a/., 
Nucleic Acids Res. 72:387-395 (1984). Percent similarity: 75.889 / percent identity: 57.905. 

1 MASGAVAD.HQIEAVSGKRVAV 21 

- I h I • • : I • • — • I I 1 
1 MLAIiTASASSASSHPYRHASAHTRRPRLRAVLAMAGSDDPRAAPARSVAV 50 

22 VGAGVSGLAAAYKLKSRGIJWTVFEADGRVGGKLRSVMQNGLIWDEGANT 71 

Mlllllllllhh •|:|||||||.:MI|:|. :.h:||||||| 
51 VGAGVSGLAAAYRLRQSGVNVTVFEAADRAGGKIRTNSEGGFVWDEGANT 100 

72 MTEAEPEVGSLLDDLGLREKQQFPISQKKRYIVRNGVPVMLPTNPIELVT 121 

llhl l-.hlllll.:|ll:| ll-|||||::M.::|.:||.|:. 
101 MTEGEWEASRIiIDDLGLQDKQQYPNSQHKRYIVKDGAPALIPSDPISLMK 150 

122 SSVLSTQSKFQILLEPFLWKK. . . .KSSKVSDASAEESVSEFFQRHFGQE 167 

IIIIIMh-::||||:|| .hllh. :|IIM 

151 SSVLSTKSKIALFFEPFLYKKANTRNSGKVSEEHLSESVGSFCERHFGRE 200 

168 WDYLIDPFVGGTSAADPDSLSMKHSFPDLWNVEKSFGSIIVGAIRTKFA 217 

lilhUII|:||||:||:|||::M|.|||:i:.:||:|MII ^ht 
201 WDYFVDPFVAGTSAGDPESLSIRHAFPALWNLERKYGSVIVGAILSKLA 250 

218 AKGGKSRDTKSSPGTKKGSRGSFSFKGGMQILPDTLCKSLSHDEINLDSK 267 

III:. ..M.::-.|.ill|.llll I . • : : . | : : . | : . . 

251 AKGDPVKTRHDSSGKRRNRRVSFSFHGGMQSLINALHNEVGDDNVKLGTE 300 

268 VLSLS. .YlSrSGSRQENWSLSCVSHNETQRQ. . .NPHYDAVIMTAPLCNVK 312 

lllh :.|h|. I-.-::: I- Hlllllllhlh 

301 VLSLACTFDGVPALGRWSISVDSKDSGDKDLASNQTFDAVIMTAPLSNVR 350 

313 EMKVMKGGQPFQIJSrFLPEINYMPLSVLITTFTKEKVKRPLEGFG^ 362 

l|. MM. hll|.::|:|||:::|.M:.||:|lllllllll I 
351 RMKFTKGGAPVVLDFLPKm^YLPLSLMVTAFKKDDVKKPLEGFGVLIPYK 400 

363 E . QKHGFKTLGTLFSSMMFPDRSPSDVHLYTTFIGGSRNQELAKASTDEL 411 

I lllhllllllllllllllhM *llllhll|:|.:|| |.|. I 
401 EQQKHGLKTLGTLFSSMMFPDRAPDDQYLYTTFVGGSHNRDLAGAPTSIL 450 
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412 KQVVTSDLQRLLGVEGEPVSVNHYYWRKAFPLYDSSYDSVMEAIDKME^ 461 

Ihlllll-lllllhl. M II .llllh .Mhllhlll.: 

451 KQLVTSDLKKLLGVEGQPTFVKHVYWGNAFPLYGHDYSSVLEAIEKMEKN 500 

462 LPGFFYAGNHRGGLSVGKSIASGCKAADLVISYLESCSNDKKPNDSL* 509 

I II II I 111 ::|M|. Nihil Ml- II I II I : 

501 LPGFFYAGNSKDGLAVGSVIASGSKAADLAISYLESHTKHNNSH* . . . 545 



Table 3A 

Cross tolerance of plant protox mutants to various protox inhibitors. 



Formula 


AraC-1Val 


AraC-2Cvs 


AraC-lThr 


AraC-3Thr 


MzC-1Val 


XVII 


+ 


+ 


+ 


+ 


+ 


Vila 


+ 




+ 




+ 


IV 


++ 




++ 


++ 




XV 


+ 


+ 


+ 


+ 


+ 


XI 




+ 


+ 


++ 


+ 


XVI 










+ 


XII 


+ 




++ 


++ 


++ 


XIV 


+ 




+ 


+ 


+ 


*X 













+ = 10X or more tolerant than WT 
++ = 100X or more tolerant than WT 
- = no cross tolerance 

* = this compound was tested but provided no information 



Table 3B 

Cross tolerance of plant protox mutants to various protox inhibitors. 





AraC- 


AraC- 


AraC- 


AraC- 


AraC- 


AraC- 


AraC- 


AraC- 




1Leu 


2lle 


ILeu 


ILeu 


2]le 


2Cys 


2Leu 


2Met 








+ 


+ 


+ 


+ 


+ 


+ 








AraC- 


AraC- 


AraC 


AraC 


AraC 


AraC 


Formula 






2Met 


2Leu 


305Leu 


425Ser 


425Ser 


425Ser 


XVII 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


+ 


Vila 


++ 


++ 


++ 


++ 


++ 


++ 


++ 


++ 


IV 


++ 




+ 


++ 


+ 




+ 


+ 


XV 


++ 


+++ 


+++ 


+++ 


+++ 


++ 


+++ 


++ 


XI 


++ 


++ 


++ 


++ 


++ 


++ 


++ 




XVI 


+++ 


+++ 


+++ 


+++ 


+++ 


+ 


++ 




XII 


















XIV 


++ 


++ 


++ 


++ 


++ 




++ 


++ 
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Table 4 

Cross tolerance to various protox inhibitors in a seed gemrtination assay. 



Fomr^ula 


Common name 


Tolerance 


II 


acifluorofen 




III 


fomasafen 


+ 


IV 


fluoroglycofen 


t 


IVb 


bifenox 


+ 


IVc 


oxyfluorofen 


+ 


IVd 


actofen 


± 


Vila 


fluthiacet-methyl 


++ 


X 


sulfentrazone 


+ 


XI 


flupropazil 


++ 


XIV 


flumiclorac 


+ 


XVI 


flumioxazin 


+++ 


XVII 




++ 


XXIa 


BAY 11 340 


+ 


XXII 


++ 



± ^ 10X more tolerant than wt 
+ > 10X more tolerant than wt 
++ > 100X more tolerant than wt 
+++ > 1 0OOX more tolerant than wt 



Various modifications of the invention described herein will become apparent to 
those skilled in the art. Such modifications are intended to fall within the scope of the 
appended claims. 
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1 . An isolated DNA molecule comprising a coding sequence that encodes the amino 
acid sequence set forth in SEQ ID NO:45. 

2. The DNA molecule according to claim 1 , wherein said DNA molecule comprises the 
coding sequence set forth in SEQ ID NO:44. 

3. A chimeric gene comprising a promoter active in a plant operatively linlced to the 
DNA molecule according to claim 1. 

4. The chimeric gene according to claim 3, additionally comprising a signal sequence 
operatively linked to said DNA molecule, wherein said signal sequence targets the amino 
acid sequence encoded by said coding sequence to a plastid. 

5. The chimeric gene according to claim 3, additionally comprising a signal sequence 
operatively linked to said DNA molecule, wherein said signal sequence targets the amino 
acid sequence encoded by said coding sequence to a mitochondria. 

6. A recombinant vector comprising the chimeric gene according to claim 3. 

7. A host cell comprising the chimeric gene according to claim 3. 

8. The host cell according to claim 7, wherein said host cell is selected from the group 
consisting of a plant cell, a bacterial cell, a yeast cell, and an insect cell. 

9. The host cell according to claim 8, wherein said host cell is a plant cell. 

10. A chimeric gene comprising a promoter functional in a plant plastid operatively linked 
to the DNA molecule according to claim 1 . 

11. The chimeric gene according to claim 10, wherein said plant plastid promoter is a 
c/pP gene promoter. 
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12. The chimeric gene according to claim 10, further comprising a 5' untranslated 
sequence (5'UTR) from said plastid promoter and a plastid gene 3' untranslated sequence 
(3' UTR) operativeiy linked to said DMA molecule. 

13. The chimeric gene according to claim 12, wherein said plant plastid promoter is a 
c/pP gene promoter, and wherein said 3' UTR is a plastid /ps16 gene 3' untranslated 
sequence. 

1 4. A plastid transfomiation vector comprising the chimeric gene according to claim 1 2. 

15. A plant plastid comprising the plastid transformation vector according to claim 14. 

1 6. A plant comprising the plant cell according to claim 9. 

1 7. Seed from the plant according to claim 1 6. 

1 8. A plant cell comprising the plant plastid according to claim 1 5. 

19. A plant comprising the plant cell according to daim 18. 

20. Seed from the plant according to claim 1 9. 

21 . An isolated polypeptide comprising the amino acid sequence set forth in SEQ ID NO: 45, 
wherein said polypeptide has protoporphyrinogen oxidase (protox) activity. 

22. A method comprising: 

(a) obtaining a host cell comprising a DNA molecule according to claim 1 ; and 

(b) expressing a polypeptide comprising the amino acid sequence set forth in SEQ 
ID NO:45 in said host cell. 

23. The method according to claim 22, wherein said host cell Is a bacterial cell, a yeast cell 
or an insect cell. 

24. The method according to claim 23, wherein said host cell is E colh 
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25. A method for assaying a chemical for the ability to inhibit the activity of the polypeptide 
according to claim 21 comprising: 

(a) combining said polypeptide and protoporphyrinogen IX in a first reaction mixture 
under conditions in which said polypeptide catalyzes the conversion of said 
protoporphyrinogen IX to protoporphyrin IX; 

(b) combining a chemical, said polypeptide and protoporphyrinogen IX in a second 
reaction mixture under the conditions of step (a); 

(c) measuring the fluorescence emission of said first and said second reaction 
mixtures at about 622 to about 635 nM after excitation of said first and said second reaction 
mixtures at about 395 to about 410 nM; 

(d) comparing the fluorescence emission of said first and said second reaction 
mixtures at about 622 to about 635 nM; 

(e) selecting a chemical that inhibits the activity of said polypeptide If the 
fluorescence emission of said second reaction mixture is less than the fluorescence 
emission of said first reaction mixture and if the decrease In fluorescence emission 
between said first and second reaction mixtures is larger than the margin of error inherent in 
the measurement of said fluorescence emission. 
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SEQUENCE LISTING 
<110> Syngenta Participations AG 

<120> PROTOPORPHYRINOGEN OXIDASE ( "PROTOX* ) GENES 

<130> PB/5-31284A 

<140> 
<141> 

<150> US 09/525,152 ! 
<151> 2000-03-14 i 

<160> 47 

<170> Patentin Ver. 2.1 

<210> 1 
<211> 1719 
<212> DNA 

<213> Arabidopsis thaliana 

<220> 

<221> CDS 

<222> (31) . . (1644) 

<223> Arabidopsis protox-1 

<400> 1 

tgacaaaatt ccgaattctc tgcgatttcc atg gag tta tct ctt etc cgt ccg 54 

Met Glu Leu Ser Leu Leu Arg Pro 
1 5 

acg act caa teg ctt ctt ccg teg ttt teg aag ecc aat etc cga tta 102 
Thr Thr Gin Ser Leu Leu Pro Ser Phe Ser Lys Pro Asn Leu Arg Leu 
10 15 20 

aat gtt tat aag cct ctt aga etc cgt tgt tea gtg gcc ggt gga cea 150 
Asn Val Tyr Lys Pro Leu Arg Leu Arg Cys Ser Val Ala Gly Gly Pro 
25 30 35 40 

ace gtc gga tct tea aaa ate gaa ggc gga gga gge acc ace ate acg 198 
Thr Val Gly Ser Ser Lys lie Glu Gly Gly Gly Gly Thr Thr lie Thr 
45 50 55 

acg gat tgt gtg att gtc ggc gga ggt att agt ggt ctt tgc ate get 246 
Thr Asp Cys Val lie Val Gly Gly Gly lie Ser Gly Leu Cys lie Ala 
60 65 70 

cag gcg ctt get act aag cat cct gat get get ccg aat tta att gtg 294 
Gin Ala Leu Ala Thr Lys His Pro Asp Ala Ala Pro Asn Leu lie Val 
75 80 85 

acc gag get aag gat cgt gtt gga ggc aac att ate act cgt gaa gag 342 
Thr Glu Ala Lys Asp Arg Val Gly Gly Asn lie lie Thr Arg Glu Glu 
90 95 100 
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aat ggt ttt etc tgg gaa gaa ggt ccc aat agt ttt caa ccg tct gat 390 
Asn Gly Phe Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
105 110 115 120 

cct atg etc act atg gtg gta gat agt ggt ttg aag gat gat ttg gtg 438 
Pro Met Leu Thr Met Val Val Asp Ser Gly Leu Lys Asp T^p Leu Val 
125 130 135 

ttg gga gat cct act geg cea agg ttt gtg ttg tgg aat ggg aaa ttg 486 
Leu Gly Asp Pro Thr Ala Pro Arg Phe Val Leu Trp Asn Gly Lys Leu 
140 145 150 

agg ccg gtt cca teg aag eta aca gac tta ccg ttc ttt gat ttg atg 534 
Arg Pro Val Pro Ser Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met 
155 160 165 

agt att ggt ggg aag att aga get ggt ttt ggt gea ctt ggc att ega 582 
Ser He Gly Gly Lys He Arg Ala Gly Phe Gly Ala Leu Gly He Arg 
170 175 180 

ccg tea cct cca ggt cgt gaa gaa tct gtg- gag gag ttt gta egg cgt 630 
Pro Ser Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
185 190 195 200 

aac etc ggt gat gag gtt ttt gag cgc etg att gaa ccg ttt tgt tea 678 
Asn Leu Gly Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser 
205 210 215 

ggt gtt tat get ggt gat cct tea aaa etg age atg aaa gea geg ttt 726 
Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 
220 225 230 

ggg aag gtt tgg aaa eta gag caa aat ggt gga age ata ata ggt ggt 774 
Gly Lys Val Trp Lys Leu Glu Gin Asn Gly Gly Ser He He Gly Gly 

235 240 245 

act ttt aag gea att eag gag agg aaa aac get ccc aag gea gaa ega 822 
Thr Phe Lys Ala He Gin Glu Arg Lys Asn Ala Pro Lys Ala Glu Arg 
250 255 260 

gac ccg cgc etg cca aaa cca eag ggc caa aca gtt ggt tct ttc agg 870 
Asp Pro Arg Leu Pro Lys Pro Gin Gly Gin Thr Val Gly Ser Phe Arg 
265 270 275 280 

aag gga ctt ega atg ttg cca gaa gea ata tct gea aga tta ggt age 918 
Lys Gly Leu Arg Met Leu Pro Glu Ala He Ser Ala Arg Leu Gly Ser 
285 290 295 

aaa gtt aag ttg tct tgg aag etc tea ggt ate act aag etg gag age 966 
Lys Val Lys Leu Ser Trp Lys Leu Ser Gly He Thr Lys Leu Glu Ser 
300 305 310 

gga gga tac aac tta aca tat gag act cca gat ggt tta gtt tec gtg 1014 
Gly Gly Tyr Asn Leu Thr Tyr Glu Thr Pro Asp Gly Leu Val Ser Val 
315 320 325 
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cag age aaa agt gtt gta atg acg gtg cca tct cat gtt gca agt ggt 1062 
Gin Ser Lys Ser Val Val Met Thr Val Pro Ser His Val Ala Ser Gly 
330 335 340 

etc ttg cgc cct ctt tct gaa tct get gea aat gca etc tea aaa eta 1110 
Leu Leu Arg Pro Leu Ser Glu Ser Ala Ala Asn Ala Leu Ser Lys Leu 
345 350 355 360 

tat tac cca cca gtt gca gca gta tct ate teg tac ccg aaa gaa gca 1158 
Tyr Tyr Pro Pro Val Ala Ala Val Ser He Ser Tyr Pro Lys Glu Ala 
365 370 375 

ate ega aca gaa tgt ttg ata gat ggt gaa eta aag ggt ttt ggg caa 1206 
He Arg Thr Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin 
380 385 390 

ttg cat cca cgc acg caa gga gtt gaa aca tta gga act ate tac age 1254 
Leu His Pro Arg Thr Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 
395 400 405 

tec tea etc ttt cca aat cgc gca ccg ccc gga aga att ttg ctg ttg 1302 
Ser Ser Leu Phe Pro Asn Arg Ala Pro Pro Gly Arg He Leu Leu Leu 
410 415 420 

aac tac att ggc ggg tct aca aac ace gga att ctg tec aag tct gaa 1350 
Asn Tyr He Gly Gly Ser Thr Asn Thr Gly He Leu Ser Lys Ser Glu 
425 430 435 440 

ggt gag tta gtg gaa gca gtt gac aga gat ttg agg aaa atg eta att 1398 
Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 
445 450 455 

aag cct aat teg ace gat cca ctt aaa tta gga gtt agg gta tgg cct 1446 
Lys Pro Asn Ser Thr Asp Pro Leu Lys Leu Gly Val Arg Val Trp Pro 
460 465 470 

caa gcc att cct cag ttt eta gtt ggt cac ttt gat ate ctt gac acg 1494 
Gin Ala He Pro Gin Phe Leu Val Gly His Phe Asp He Leu Asp Thr 
475 480 485 

get aaa tea tct eta acg tct teg ggc tac gaa ggg eta ttt ttg ggt 1542 
Ala Lys Ser Ser Leu Thr Ser Ser Gly Tyr Glu Gly Leu Phe Leu Gly 
490 495 500 

ggc aat tac gtc get ggt gta gee tta ggc egg tgt gta gaa ggc gca 1590 
Gly Ash Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 
505 510 515 520 

tat gaa ace gcg att gag gtc aac aac ttc atg tea egg tac get tac 1638 
Tyr Glu Thr Ala He Glu Val Asn Asn Phe Met Ser Arg Tyr Ala Tyr 
525 530 535 

aag taa atgtaaaaca ttaaatetec eagcttgegt gagttttatt aaatattttg 1694 
Lys 

agatatccaa aaaaaaaaaa aaaaa 1719 
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<210> 2 

<211> 537 
<212> PRT 

<213> Arabidopsis tha liana 



<400> 2 














Met 


Glu 


Leu 


Ser 


Leu 


Leu 


Ara Pro Thr Thr 


Gin Ser Leti T^pu l£>yc% Qor* 


1 








5 




1 0 

Xv 


XD 


Phe 


Ser 


Lys 


Pro 


Asn 


Leu 


Aro Li^u AsTi Val 


Tvir TiVS PT*rt Ti0ii A>"rT T^ii 
MJJfSf C^*,KJ JJt?U AX^g XRSU 








20 






25 




Ara 


Cvs 


Ser 


Val 


Ala 


Glv 


Glv Pro Thr Val 


Glv Ser Ser TiV^ T1» Glu 






35 








40 




Glv 


Gly 




Glv 


XilX 


XllX 


XXC lllJL ixix 


v*y5 vax xxe vax oJ.y vaxy 




50 












DU 


Glv 


Tip 




Glv 


T .oil 


cys 


Tie Ala Ilia 

xxe AX a V7X11 Axa 


jueu Axa inr jjys nxs pro 












/ u 




Q on 


Asp 


Ala 




Pro 


Asn 


Leu 


xie vax inr Giu 


Ala Lys Asp Arg Val Gly 










Q ^ 




on 


95 


c^^ \r 
vjxy 


ASXl 




X j.e 


ilJlX 


Arg 


v^xu vjxu Asn vjxy 


Phe Leu Trp Glu Glu Gly 








XVJ u 






xUo 


110 


PlTO 


Ash 


Ser 






Pro 


£>ex Asp irro new 


Lieu xnr nez vax vax Asp 






115 








X A V 


IOC 




Glv 


Leu 


Lys 


AkSp 


Asp 


jjqu vax JLieu v^xy 


Asp Jrro inr Axa Fro Arg 
















An 
X4U 


Phe 


Val 


Li6U 


Trp 


Asn 


laxy 


jjys ijeu Axg irro 


Val Pro Ser Lys Leu Thr 


145 










XD\J 




XD3 160 






IT J. I_l 




irllvs 


Asp 


jlj€u neu oex xxe 


v»xy vaxy jjys xxe Arg AXa 










ICR 




1 /O 


175 


Gly 


Phe 




AX a 


Leu 




xxe Arg Jrro ber 


Pro Pro Gly Arg Glu Glu 








1 fin 
xau 






X 09 


1 on 
X^U 


Ser 


Val 


Glu 


Glu 


Phe 


Val 


Atct Attt Aqti 


Glv A en Gin Val PVia flT ii 
V9xy Aaju V9XU VaX irxie v^XU 






195 








200 


<b Z7 


Arg 


Leu 


lie 


Glu 


Pro 


Phe 


Cvs Ser Glv Val 


'Pv>* Ala f5l \7 AQTi Pr'rk 

•L y J- AJLCl V3Xy C HJ OCX 




210 










215 


220 


Lys 


Leu 


Ser 


Met 


Lys 


Ala 


Ala Phe Glv Lvs 


Val Tm TjVQ TiP^u Gin r» 
vox xjL^ xjy 0 ucu uxu uixix 


225 










230 






Asn 


Glv 


Gly 


Ser 


He 


He 


Gly Gly Thr Phe 


Lvs Ala Tlt> Gin Glii Aytt 










245 








Lys 


Asn 


Ala 


Pro 


Lys 


Ala 


Glu Arcf Aqo Pm 


AtTT TiOlt PY"^ Ta/G P-^rt n 








260 






265 




Gly 


Gin 


Thr 


Val 


Gly 


Ser 


Phe Arg Lys Gly 


Leu Arg Met Leu Pro Glu 






275 








280 


285 


Ala 


lie 


Ser 


Ala 


Arg 


Leu 


Gly Ser Lys Val 


Lys Leu Ser Trp Lys Leu 




290 










295 


300 


Ser 


Gly 


He 


Thr 


Lys 


Leu 


Glu Ser Gly Gly 


Tyr Asn Leu Thr Tyr Glu 


305 










310 




315 320 


Thr 


Pro 


Asp 


Gly 


Leu 


Val 


Ser Val Gin Ser 


Lys Ser Val Val Met Thr 










325 




330 


335 


Val 


Pro 


Ser 


His 


Val 


Ala 


Ser Gly Leu Leu 


Arg Pro Leu Ser Glu Ser 








340 






345 


350 


Ala 


Ala 


Asn 


Ala 


Leu 


Ser 


Lys Leu Tyr Ty^^ 


Pro Pro Val Ala Ala Val 






355 








360 


365 


Ser 


He 


Ser 


Tyr 


Pro 


Lys 


Glu Ala He Arg 


Thr Glu Cys Leu He Asp 




370 










375 


380 


Gly 


Glu 


Leu 


Lys 


Gly 


Phe 


Gly Gin Leu His 


Pro Arg Thr Gin Gly Val 


385 










390 




395 400 
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Glu 


Thr 


Leu Gly 


Thr lie Tyr 


Ser 


Ser 


Ser 

Al ft 


Leu Phe 


Pro 


Asn 


Arg 


Ala 


Pro 


Pro 


Gly Arg 


lie Leu Leu 


Leu 


Asn 


Tyr 


He Gly 


Gly 


Ser 

yi "a ft 


Thr 


Asn 


Thr 


Gly 


lie Leu 


Ser Lys Ser 


Glu 


Gly 


Glu 


Leu Val 


Glu 


Ala 


Val 


Asp 










AA ft 








A AC 






Arg 


Asp 


Leu Arg 


Lys Met Leu 


lie 


Lys 


Pro 


Asn Ser 
460 


Thr 


Asp 


Pro 


Leu 


Lys 


Leu 


Gly Val 


Arg Val Trp 


Pro 


Gin 


Ala 


He Pro 


Gin 


Phe 


Leu 


Val 


465 






470 








475 








480 


Gly 


His 


Phe Asp 


He Leu Asp 
485 


Thr 


Ala 


Lys 
490 


Ser Ser 


Leu 


Thr 


Ser 
495 


Ser 


Gly 


Tyr 


Glu Gly 
500 


Leu Phe Leu 


Gly 


Gly 
505 


Asn 


Tyr Val 


Ala 


Gly 
510 


Val 


Ala 


Leu 


Gly 


Arg Cys 
515 


Val Glu Gly 


Ala 
520 


Tyr 


Glu 


Thr Ala 


He 
525 


Glu 


Val 


Asn 


Asn 


Phe 
530 


Met Ser 


Arg Tyr Ala 
535 


Tyr 


Lys 















<210> 3 
<211> 1738 
<212> DNA 

<213> Arabidopsis thalieina 

<220> 

<221> CDS 

<222> (70) . . (1596) 

<223> Arabidopsis protox-2 

<400> 3 

ttttttactt atttccgtca ctgctttcga ctggtcagag attttgactc tgaattgttg 60 

cagatagca atg gcg tct gga gca gta gca gat cat caa att gaa gcg gtt 111 
Met Ala Ser Gly Ala Val Ala Asp His Gin He Glu Ala Val 
15 10 

tea gga aaa aga gtc gca gtc gta ggt gca ggt gta agt gga ctt gcg 159 
Ser Gly Lys Arg Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala 
15 20 25 30 

gcg get tac aag ttg aaa teg agg ggt ttg aat gtg act gtg ttt gaa 207 
Ala Ala Tyr Lys Leu Lys Ser Arg Gly Leu Asn Val Thr Val Phe Glu 
35 40 45 

get gat gga aga gta ggt ggg aag ttg aga agt gtt atg caa aat ggt 255 
Ala Asp Gly Arg Val Gly Gly Lys Leu Arg Ser Val Met Gin Asn Gly 
50 55 60 

ttg att tgg gat gaa gga gca aac aec atg act gag get gag cea gaa 303 
Leu He Trp Asp Glu Gly Ala Asn Thr Met Thr Glu Ala Glu Pro Glu 
65 70 75 

gtt ggg agt tta ctt gat gat ctt ggg ctt cgt gag aaa caa caa ttt 351 
Val Gly Ser Leu Leu Asp Asp Leu Gly Leu Arg Glu Lys Gin Gin Phe 
80 85 90 
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cca att tea cag aaa aag egg tat att gtg egg aat ggt gta cct gtg 399 
Pro He Ser Gin Lys Lys Arg Tyr He Val Arg Asn Gly Val Pro Val 
95 100 105 110 

atg eta eet ace aat ecc ata gag ctg gtc aea agt agt gtg etc tct 447 
Met Leu Pro Thr Asn Pro He Glu Leu Val Thr Ser Ser Val Leu Ser 
115 120 125 

ace caa tct aag ttt eaa ate ttg ttg gaa cca ttt tta tgg aag aaa 495 
Thr Gin Ser Lys Phe Gin He Leu Leu Glu Pro Phe Leu Trp Lys Lys 
130 135 140 

aag tee tea aaa gtc tea gat gea tct get gaa gaa agt gta age gag 543 
Lys Ser Ser Lys Val Ser Asp Ala Ser Ala Glu Glu Ser Val Ser Glu 
145 150 155 

ttc ttt eaa egc cat ttt gga caa gag gtt gtt gae tat etc ate gae 591 
Phe Phe Gin Arg His Phe Gly Gin Glu Val Val Asp Tyr Leu He Asp 
160 165 170 

cct ttt gtt ggt gga aea agt get gcg gae cct gat tee ett tea atg 639 
Pro Phe Val Gly Gly Thr Ser Ala Ala Asp Pro Asp Ser Leu Ser Met 
175 180 185 190 

aag cat tct ttc cca gat etc tgg aat gta gag aaa agt ttt ggc tct 687 
Lys His Ser Phe Pro Asp Leu Trp Asn Val Glu Lys Ser Phe Gly Ser 
195 200 205 

att ata gtc ggt gca ate aga aea aag ttt get get aaa ggt ggt aaa 735 
He He Val Gly Ala He Arg Thr Lys Phe Ala Ala Lys Gly Gly Lys 
210 215 220 

agt aga gae aea aag agt tct cct ggc aea aaa aag ggt teg cgt ggg 783 
Ser Arg Asp Thr Lys Ser Ser Pro Gly Thr Lys Lys Gly Ser Arg Gly 
. 225 230 235 

tea ttc tct ttt aag ggg gga atg cag att ett cct gat aeg ttg tge 831 
Ser Phe Ser Phe Lys Gly Gly Met Gin He Leu Pro Asp Thr Leu Cys 
240 245 250 

aaa agt etc tea cat gat gag ate aat tta gae tec aag gta etc tct 879 
Lys Ser Leu Ser His Asp Glu He Asn Leu Asp Ser Lys Val Leu Ser 
255 260 265 270 

ttg tct tac aat tct gga tea aga cag gag aac tgg tea tta tct tgt 927 
Leu Ser Tyr Asn Ser Gly Ser Arg Gin Glu Asn Trp Ser Leu Ser Cys 
275 280 285 

gtt teg cat aat gaa aeg cag aga caa aac ecc cat tat gat get gta 975 
Val Ser His Asn Glu Thr Gin Arg Gin Asn Pro His Tyr Asp Ala Val 
290 295 300 

att atg aeg get cct ctg tge aat gtg aag gag atg aag gtt atg aaa 1023 
He Met Thr Ala Pro Leu Cys Asn Val Lys Glu Met Lys Val Met Lys 
305 310 315 
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gga gga caa ccc ttt cag eta aac ttt etc ccc gag att aat tac atg 1071 
Gly Gly Gin Pro Phe Gin Leu Asn Phe Leu Pro Glu lie Asn Tyr Met 
320 325 330 

ccc etc teg gtt tta ate aee aca tte aea aag gag aaa gta aag aga 1119 
Pro Leu Ser Val Leu lie Thr Thr Phe Thr Lys Glu Lys Val Lys Arg 
335 340 345 350 

eet ctt gaa ggc ttt ggg gta etc att eea tet aag gag eaa aag eat 1167 
Pro Leu Glu Gly Phe Gly Val Leu lie Pro Ser Lys Glu Gin Lys His 
355 360 365 

ggt tte aaa act eta ggt aea ett ttt tea tea atg atg ttt eea gat 1215 
Gly Phe Lys Thr Leu Gly Thr Leu Phe Ser Ser Met Met Phe Pro Asp 
370 375 380 

cgt tee ect agt gae gtt eat eta tat aca act ttt att ggt ggg agt 1263 
Arg Ser Pro Ser Asp Val His Leu Tyr Thr Thr Phe lie Gly Gly Ser 
385 390 395 

agg aac cag gaa eta gee aaa get tec act gae gaa tta aaa eaa gtt 1311 
Arg Asn Gin Glu Leu Ala Lys Ala Ser Thr Asp Glu Leu Lys Gin Val 
400 405 410 

gtg act tet gae ctt cag cga etg ttg ggg gtt gaa ggt gaa ccc gtg 1359 
Val Thr Ser Asp Leu Gin Arg Leu Leu Gly Val Glu Gly Glu Pro Val 
415 420 425 430 

tet gte aac cat tac tat tgg agg aaa gca tte ccg ttg tat gae age 1407 
Ser Val Asn His Tyr Tyr Trp Arg Lys Ala Phe Pro Leu Tyr Asp Ser 
435 440 445 

age tat gae tea gte atg gaa gca att gae aag atg gag aat gat eta 1455 
Ser Tyr Asp Ser Val Met Glu Ala lie Asp Lys Met Glu Asn Asp Leu 
450 455 460 

ect ggg tte tte tat gca ggt aat cat cga ggg ggg etc tet gtt ggg 1503 
Pro Gly Phe Phe Tyr Ala Gly Asn His Arg Gly Gly Leu Ser Val Gly 
465 470 475 

aaa tea ata gea tea ggt tge aaa gea get gae ett gtg ate tea tae 1551 
Lys Ser He Ala Ser Gly Cys Lys Ala Ala Asp Leu Val He Ser Tyr 
480 485 490 

ctg gag tet tge tea aat gae aag aaa eea aat gae age tta taa 1596 
Leu Glu Ser Cys Ser Asn Asp Lys Lys Pro Asn Asp Ser Leu 
495 500 505 

cattgtcaag gttcgteeet ttttateaet taetttgtaa acttgtaaaa tgeaacaage 1656 
egecgtgcga ttagccaaca actcagcaaa aeceagattc teataagget cactaattce 1716 
agaataaaet atttatgtaa aa 1738 



<210> 4 
<211> 508 
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<212> PRT 

<213> Arabidopsis tha liana 



<400> 4 
















nec 


AJLa 


ber 


\»iy 


Aia 


Val 


Ala Asp His Gin 


lie Glu Ala vai ser Gly 


X 














10 


15 


Lys 


Arg 


VclX 


AXa 


vai 


vax 


Gly Ala Gly Val 


Ser Gly Leu Ala Ala Ala 
















25 


^ A 

oO 


Tyx 


Lys 


Leu 


Lys 


Ser 


Arg 


Gly Leu Asn Val 


Thr Val Phe Glu Ala Asp 
















40 


45 




Arg 


vax 


Giy 


Gly 


Lys 


Leu Arg Ser Val 


Met Gin Asn Gly Leu He 














55 




C A 

bU 


Trp 


Asp 


Glu 


Gly 


Ala 


Asn 


Thr 


Met Thr Glu 


Ala Glu Pro Glu Val Gly 


65 










70 






75 80 


Ser 


Leu 


Leu 


Asp 


Asp 


Leu 


Gly Leu Arg Glu 


Lys Gin Gin Phe Pro He 










85 






90 


95 


S&T 


GJLn 


Lys 


Lys 


Arg 




lie 


Val Arg Asn 


Gly Val Pro Val Met Leu 








1 AA 
lUU 








105 


110 


Pro 


Thr 


Asn 


Pro 


lie 


Glu 


Leu 


Val Thr Ser 


Ser Val Leu Ser Thr Gin 






115 










120 


125 




Lys 


Phe 


Gin 


lie 


Leu 


Leu 


Glu Pro Phe 


Leu Trp Lys Lys Lys Ser 




130 










135 




t il A 

140 


Ser 


Lys 


vai 


Ser 


Asp 


Ala 


Ser 


Ala Glu Glu 


Ser Val Ser Glu Phe Phe 


145 










150 






155 160 


Gin 


Arg 


His 


Phe 


Gly 


Gin 


Glu 


Val Val Asp 


Tyr Leu He Asp Pro Phe 










lob 






170 


175 


TT-s T 

vaJ. 


Gly 


Gly 


Tnr 


Ser 


Ala 


Ala Asp Pro Asp 


Ser Leu Ser Met Lys His 








1 O A 

loU 








185 


190 


Ser 


Pile 


Pro 


Asp 


Leu 


Trp 


Asn Val Glu Lys 


Ser Phe Gly Ser He He 






1^3 










200 


205 


vai 


Gly 


Ala 


lie 


Arg 


Thr 


Lys 


Phe Ala Ala 


Lys Gly Gly Lys Ser Arg 














215 




220 


Asp 


Thr 


Lys 


SGT 


Ser 


Pro 


Gly Thr Lys Lys 


Gly Ser Arg Gly Ser Phe 


















235 240 


3er 




Lys 


oiy 


Giy 


nec 


Gin 


He Leu Pro 


Asp Thr Leu Cys Lys Ser 










4%D 






250 


o c c 
z55 


Leu 


Ser 


nxs 


Asp 


VaXU 


xxe 


Asn Leu Asp Ser 


Lys Val Leu Ser Leu Ser 








Z D tl 








265 


OTA 
Z f\) 


Tyr 


Asn 


Ser 


k»iy 


Ser 


Arg 


Gin 


Glu Asn Trp 


Ser Leu Ser Cys Val Ser 






Z / 3 










280 


IOC 


ms 


Asn 


ValU 


inr 


Gin 


Arg 


Gin 


Asn Pro His 


Tyr Asp Ala Val He Met 














295 




inn 


Thr 


Ala 


Pro 


Leu 


Cys 


Asn 


Val 


Lys Glu Met 


iiys vai £iec Lys Giy Gly 


305 










310 






JXJ OZU 


Gin 


Pro 


Phe 


Gin 


Leu 


Asn 


Phe 


Leu Pro Glu 


He Asn Tyr Met Pro Leu 










325 






330 


335 


Ser 


Val 


Leu 


lie 


Thr 


Thr 


Phe 


Thr Lys Glu 


Lys Val Lys Arg Pro Leu 








340 








345 


350 


Glu 


Gly 


Phe 


Gly 


Val 


Leu 


He 


Pro Ser Lys 


Glu Gin Lys His Gly Phe 






355 










360 


365 


Lys 


Thr 


Leu 


Gly 


Thr 


Leu 


Phe 


Ser Ser Met 


Met Phe Pro Asp Arg Ser 




370 










375 




380 


Pro 


Ser 


Asp 


Val 


His 


Leu 


Tyr 


Thr Thr Phe 


He Gly Gly Ser Arg Asn 


385 










390 






395 400 


Gin 


Glu 


Leu 


Ala 


Lys 


Ala 


Ser Thr Asp Glu 


Leu Lys Gin Val Val Thr 










405 






410 


415 


Ser 


Asp 


Leu 


Gin 


Arg 


Leu 


Leu Gly Val Glu 


Gly Glu Pro Val Ser Val 
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420 




425 








430 






Asn 


His 


Tyr 


Tyr 


Trp Arg Lys 


Ala Phe 


Pro 


Leu Tyr 


Asp Ser 


Ser 


Tyr 






435 






440 








445 






Asp 


Ser 


Val 


Met 


Glu Ala He 


Asp Lys 


Met 


Glu 


Asn 


Asp Leu 


Pro 


Gly 




450 






455 








460 






Phe 


Phe 


Tyr 


Ala 


Gly Asn His 


Arg Gly 


Gly 


Leu 


Ser 


Val Gly 


Lys 


Ser 


465 








470 






475 




480 


He 


Ala 


Ser 


Gly 


Cys Lys Ala 


Ala Asp 


Leu 


Val 


He 


Ser Tyr 


Leu 


Glu 










485 




490 








495 




Ser Cys 


Ser 


Asn 


Asp Lys Lys 


Pro Asn 


Asp 


Ser 


Leu 














500 




505 















<210> 5 
<211> 1691 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) ..(1443) 

<223> maize protox-1 c-DNA(not full-length) 
<400> 5 

gcg gac tgc gtc gtg gtg ggc gga ggc ate agt ggc etc tge acc gcg 48 
Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu Cys Thr Ala 
15 10 15 

cag gcg ctg gcc acg egg cac ggc gtc ggg gac gtg ctt gtc acg gag 96 
Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val Thr Glu 
20 25 30 

gcc cgc gcc cgc ccc ggc ggc aac att acc acc gtc gag cgc ccc gag 144 
Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg Pro Glu 
35 40 45 

gaa ggg tac etc tgg gag gag ggt ccc aac age ttc cag ccc tec gac 192 
Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
50 55 60 

ccc gtt etc acc atg gcc gtg gac age gga ctg aag gat gac ttg gtt 240 
Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
65 70 75 80 

ttt ggg gac cea aac gcg ccg cgt ttc gtg ctg tgg gag ggg aag ctg 288 
Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly Lys Leu 
85 90 95 

agg ccc gtg cea tec aag ccc gee gac etc ccg ttc ttc gat etc atg 336 
Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp Leu Met 
100 105 110 

age ate cea ggg aag etc agg gcc ggt eta ggc gcg ctt ggc ate cgc 384 
Ser He Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly He Arg 
115 120 125 
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ccg cct cct cca ggc cgc gaa gag tea gtg gag gag ttc gtg cgc cgc 432 
Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
130 135 140 

aac etc ggt get gag gtc ttt gag cgc etc att gag cct ttc tgc tea 480 
Asn Leu Gly Ala Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser 
145 150 155 160 

ggt gtc tat get ggt gat cct tct aag etc age atg aag get gca ttt 528 
Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 
165 170 175 

ggg aag gtt tgg egg ttg gaa gaa act gga ggt agt att att ggt gga 576 
Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser lie lie Gly Gly 
180 185 190 

acc ate aag aea att cag gag agg age aag aat cca aaa cca ccg agg 624 
Thr lie Lys Thr lie Gin Glu Arg Ser Lys Asn Pro Lys Pro Pro Arg 
195 200 205 

gat gee cgc ctt ccg aag cca aaa ggg cag aea gtt gca tct ttc agg 672 
Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala Ser Phe Arg 
210 215 220 

aag ggt ctt gcc atg ctt cca aat gcc att aca tec age ttg ggt agt 720 
Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser Leu Gly Ser 
225 230 235 240 

aaa gtc aaa eta tea tgg aaa etc acg age att aca aaa tea gat gac 768 
Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ser Asp Asp 
245 250 255 

aag gga tat gtt ttg gag tat gaa acg cca gaa ggg gtt gtt teg gtg * 816 
Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val Val Ser Val 
260 265 270 

cag get aaa agt gtt ate atg act att cca tea tat gtt get age aac' 864 
Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyx Val Ala Ser Asn 
275 280 285 

att ttg cgt cca ctt tea age gat get gca gat get eta tea aga ttc 912 
He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser Arg Phe 
290 295 300 

tat tat cca ccg gtt get get gta act gtt teg tat cca aag gaa gca 960 
Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala 
305 310 315 320 

att aga aaa gaa tgc tta att gat ggg gaa etc cag ggc ttt gge cag 1008 
He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin 
325 330 335 

ttg eat cca cgt agt caa gga gtt gag aea tta gga aca ata tae agt 1056 
Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 
340 345 350 

tec tea etc ttt cca aat cgt get cct gac ggt agg gtg tta ctt eta 1104 
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Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val Leu Leu Leu 
355 360 365 

aac tac ata gga ggt get aca aac aca gga att gtt tec aag act gaa 1152 
Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser Lys Thr Glu 
370 375 380 

agt gag ctg gtc gaa gca gtt gac cgt gac etc ega aaa atg ctt ata 1200 
Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met Leu He 
385 390 395 400 

aat tct aca gca gtg gac cct tta gtc ctt ggt gtt ega gtt tgg cca 1248 
Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg Val Trp Pro 
405 410 415 

caa gee ata cct cag ttc ctg gta gga cat ctt gat ctt ctg gaa gcc 1296 
Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Glu Ala 
420 425 430 

gca aaa get gee ctg gac ega ggt ggc tac gat ggg ctg ttc eta gga 1344 
Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
435 440 445 

ggg aac tat gtt gca gga gtt gcc ctg ggc aga tgc gtt gag ggc gcg 1392 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala 
450 455 460 

tat gaa agt gcc teg. caa ata tct gac ttc ttg ace aag tat gcc tac 1440 
Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 
465 470 475 480 

aag tgatgaaaga agtggagcgc tacttgttaa tcgtttatgt tgcatagatg 1493 
Lys 

aggtgcctcc ggggaaaaaa aagcttgaat agtatttttt attcttattt tgtaaattgc 1553 
atttctgttc ttttttctat cagtaattag ttatatttta gttctgtagg agattgttct 1613 
gttcactgcc cttcaaaaga aattttattt ttcattcttt tatgagagct gtgctactta 1673 
aaaaaaaaaa aaaaaaaa 1691 



<210> 6 
<211> 481 
<212> PRT 
<213> Zea mays 

<400> 6 

Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu Cys Thr Ala 
15 10 15 

Gin Ala Leu Ala Thr Arg His Gly Val Gly Asp Val Leu Val Thr Glu 
20 25 30 

Ala Arg Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg Pro Glu 
35 40 45 
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Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
50 55 60 

Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
65 70 75 80 

Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly Lys Leu 
85 90 95 

Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe Asp Leu Met 
100 105 110 

Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly He Arg 
115 120 125 

Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
130 135 140 

Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser 
145 150 155 160 

Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 
165 170 175 

Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser He He Gly Gly 
180 185 190 

Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys Pro Pro Arg 
195 200 205 

Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala Ser Phe Arg 
210 215 220 

Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser Leu Gly Ser 
225 230 235 240 

Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys Ser Asp Asp 
245 250 255 

Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val Val Ser Val 
260 265 270 

Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val Ala Ser Asn 
275 280 285 

He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu Ser Arg Phe 
290 295 300 

Tyr Tyr Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro Lys Glu Ala 
305 310 315 320 

He Arg Lys Glu Cys Leu He Asp Gly Glu Leu Gin Gly Phe Gly Gin 
325 330 335 



Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser 
340 345 350 
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Ser Ser Leu Phe Pro Asn Arg 
355 

Asn Tyr lie Gly Gly Ala Thr 
370 375 

Ser Glu Leu Val Glu Ala Val 
385 390 

Asn Ser Thr Ala Val Asp Pro 
405 

Gin Ala lie Pro Gin Phe Leu 
420 

Ala Lys Ala Ala Leu Asp Arg 
435 

Gly Asn Tyr Val Ala Gly Val 
450 455 

Tyx Glu Ser Ala Ser Gin lie 
465 * 470 

Lys 



<210> 7 

<211> 2061 

<212> DNA 

<213> Zea mays 

<220> 
<221> CDS 
<222> (64) (1698) 
<223> maize protox-2 

<400> 7 

ctctcctacc tccacctcca cgacaacaag caaatcccca tccagttcca aaccctaact 60 

caa atg etc get ttg act gcc tea gcc tea tec get teg tec cat cct 108 
Met Leu Ala Leu Thr Ala Ser Ala Ser Ser Ala Ser Ser His Pro 

1 5 10 * 15 . 

tat cgc cac gcc tec geg cac act cgt egc ccc cgc eta cgt gcg gtc 156 
Tyr Arg His Ala Ser Ala His Thr Arg Arg Pro Arg Leu Arg Ala Val 
20 25 30 

etc gcg atg gcg ggc tec gac gac ccc cgt gca gcg ccc gcc aga teg 204 
Leu Ala Met Ala Gly Ser Asp Asp Pro Arg Ala Ala Pro Ala Arg Ser 
35 40 45 

gtc gcc gtc gtc ggc gcc ggg gtc age ggg etc gcg gcg gcg tac agg 252 
Val Ala Val Val Gly Ala Gly Val Ser Gly Leu Ala Ala Ala Tyr Arg 



Ala Pro Asp Gly Arg Val Leu Leu Leu 
360 365 

Asn Thr Gly lie Val Ser Lys Thr Glu 
380 

Asp Arg Asp Leu Arg Lys Met Leu He 
395 400 

Leu Val Leu Gly Val Arg Val Trp Pro 
410 415 

Val Gly His Leu Asp Leu Leu Glu Ala 
425 430 

Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
440 445 

Ala Leu Gly Arg Cys Val Glu Gly Ala 
460 



Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 
475 480 
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50 55 60 

etc aga cag age ggc gtg aac gta acg gtg ttc gaa gcg gcc gac agg 300 
Leu Arg Gin Ser Gly Val Asn Val Thr Val Phe Glu Ala Ala Asp Arg 
65 70 75 

gcg gga gga aag ata egg acc aat tec gag ggc ggg ttt gtc tgg gat 348 
Ala Gly Gly Lys lie Arg Thr Asn Ser Glu Gly Gly Phe Val Trp Asp 
80 85 90 95 

gaa gga get aac acc atg aca gaa ggt gaa tgg gag gcc agt aga ctg 396 
Glu Gly Ala Asn Thr Met Thr Glu Gly Glu Trp Glu Ala Ser Arg Leu 
100 105 110 

att gat gat ctt ggt eta caa gac aaa cag cag tat cct aac tec caa 444 
lie Asp Asp Leu Gly Leu Gin Asp Lys Gin Gin Tyr Pro Asn Ser Gin 
115 120 125 

cac aag cgt tac att gtc aaa gat gga gca cca gca ctg att cct teg 492 
His Lys Arg Tyr He Val Lys Asp Gly Ala Pro Ala Leu He Pro Ser 
130 135 140 

gat ecc att teg eta atg aaa age agt gtt ctt teg aca aaa tea aag 540 
Asp Pro He Ser Leu Met Lys Ser Ser Val Leu Ser Thr Lys Ser Lys 
145 150 155 

att gcg tta ttt ttt gaa cca ttt etc tac aag aaa get aac aca aga 588 
He Ala Leu Phe Phe Glu Pro Phe Leu Tyr Lys Lys Ala Asn Thr Arg 
160 165 170 175 

aac tet gga aaa gtg tet gag gag eac ttg agt gag agt gtt ggg age 636 
Asn Ser Gly Lys Val Ser Glu Glu His Leu Ser Glu Ser Val Gly Ser 
180 185 190 

ttc tgt gaa cgc cac ttt gga aga gaa gtt gtt gac tat ttt gtt gat 684 
Phe Cys Glu Arg His Phe Gly Arg Glu Val Val Asp Tyr Phe Val Asp 
195 200 205 

cca ttt gta get gga aca agt gca gga gat cca gag tea eta tet att 732 
Pro Phe Val Ala Gly Thr Ser Ala Gly Asp Pro Glu Ser Leu Ser He 
210 215 220 

cgt cat gca ttc cca gca ttg tgg aat ttg gaa aga aag tat ggt tea 780 
Arg His Ala Phe Pro Ala Leu Trp Asn Leu Glu Arg Lys Tyr Gly Ser 
225 230 235 

gtt att gtt ggt gcc ate ttg tet aag eta gca get aaa ggt gat cca 828 
Val He Val Gly Ala He Leu Ser Lys Leu Ala Ala Lys Gly Asp Pro 
240 245 250 255 

gta aag aca aga cat gat tea tea ggg aaa aga agg aat aga cga gtg 876 
Val Lys Thr Arg His Asp Ser Ser Gly Lys Arg Arg Asn Arg Arg Val 
260 265 270 

teg ttt tea ttt eat ggt gga atg cag tea eta ata aat gca ctt cac 924 
Ser Phe Ser Phe His Gly Gly Met Gin Ser Leu He Asn Ala Leu His 
275 280 285 
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aat gaa gtt gga gat gat aat gtg aag ctt ggt aca gaa gtg ttg tea 972 
Asn Glu Val Gly Asp Asp Asn Val Lys Leu Gly Thr Glu Val Leu Ser 
290 295 300 

ttg gca tgt aca ttt gat gga gtt cct gca eta ggc agg tgg tea att 1020 
Leu Ala Cys OOir Phe Asp Gly Val Pro Ala Leu Gly Arg Trp Ser lie 
305 310 315 

tct gtt gat teg aag gat age ggt gae aag gae ctt get agt aac caa 1068 
Ser Val Asp Ser Lys Asp Ser Gly Asp Lys Asp Leu Ala Ser Asn Gin 
320 325 330 335 

aec ttt gat get gtt ata atg aca get cca ttg tea aat gtc egg agg 1116 
Thr Phe Asp Ala Val lie Met Thr Ala Pro Leu Ser Asn Val Arg Arg 
340 345 350 

atg aag ttc ace aaa ggt gga get ccg gtt gtt ctt gae ttt ctt cct 1164 
Met Lys Phe Thr Lys Gly Gly Ala Pro Val Val Leu Asp Phe Leu Pro 
355 360 365 

aag atg gat tat eta eea eta tct etc atg gtg act get ttt aag aag 1212 
Lys Met Asp Tyx Leu Pro Leu Ser Leu Met Val Thr Ala Phe Lys Lys 
370 375 380 

gat gat gtc aag aaa cct ctg gaa gga ttt ggg gtc tta ata cct tac 1260 
Asp Asp Val Lys Lys Pro Leu Glu Gly Phe Gly Val Leu lie Pro Tyr 
385 390 395 

aag gaa cag caa aaa cat ggt ctg aaa ace ctt ggg act etc ttt tee 1308 
Lys Glu Gin Gin Lys His Gly Leu Lys Thr Leu Gly Thr Leu Phe Ser 
400 405 410 415 

tea atg atg ttc cca gat ega get cct gat gae caa tat tta tat aca 1356 
Ser Met Met Phe Pro Asp Arg Ala Pro Asp Asp Gin Tyr Leu Tyr Thr 
420 425 430 

aca ttt gtt ggg ggt age eac aat aga gat ctt get gga get eea acg 1404 
Thr Phe Val Gly Gly Ser His Asn Arg Asp Leu Ala Gly Ala Pro Thr 
435 440 445 

tct att ctg aaa caa ctt gtg aec tct gae ctt aaa aaa etc ttg ggc 1452 
Ser He Leu Lys Gin Leu Val Thr Ser Asp Leu Lys Lys Leu Leu Gly 
450 455 460 

gta gag ggg caa cca act ttt gtc aag eat gta tac tgg gga aat get 1500 
Val Glu Gly Gin Pro Thr Phe Val Lys His Val Tyr Trp Gly Asn Ala 
465 470 475 

ttt cct ttg tat ggc cat gat tat agt tct gta ttg gaa get ata gaa 1548 
Phe Pro Leu Tyr Gly His Asp Tyr Ser Ser Val Leu Glu Ala He Glu 
480 485 490 495 

aag atg gag aaa aac ctt cca ggg ttc ttc tac gca gga aat age aag 1596 
Lys Met Glu Lys Asn Leu Pro Gly Phe Phe Tyx Ala Gly Asn Ser Lys 
500 505 510 
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gat ggg ctt get gtt gga agt gtt ata get tea gga age aag get get 1644 
Asp Gly Leu Ala Val Gly Ser Val He Ala Ser Gly Ser Lys Ala Ala 
515 520 525 

gac ctt gca ate tea tat ett gaa tet cae ace aag cat aat aat tea 1692 
Asp Leu Ala He Ser Tyr Leu Glu Ser His Thr Lys His Asn Asn Ser 
530 535 540 

cat tga aagtgtetga eetateetct ageagttgte gaeaaattte tccagttcat 1748 
His 

545 

gtacagtaga aaecgatgeg ttgcagttte agaaeatctt eacttcttca gatattaace 1808 
cttegttgaa eateeaeeag aaaggtagtc aeatgtgtaa gtgggaaaat gaggttaaaa 1868 
aetattatgg eggeegaaat gttccttttt gttttcctca eaagtggcct acgaeacttg 1928 
atgttggaaa tacatttaaa tttgttgaat tgtttgagaa eacatgcgtg acgtgtaata 1988 
tttgectatt gtgattttag eagtagtctt ggceagatta tgctttacge etttaaaaaa 2048 
aaaaaaaaaa aaa 2061 



<210> 8 
<211> 544 
<212> PRT 
<213> Zea mays 

<400> 8 



Met 


Leu 


Ala 


Leu 


Thr 


Ala 


Ser 


Ala 


1 








5 








Arg 


His 


Ala 


Ser 

20 


Ala 


His 


Thr 


Arg 


Ala 


Met 


Ala 
35 


Gly 


Ser 


Asp 


Asp 


Pro 
40 


Ala 


Val 
50 


Val 


Gly 


Ala 


Gly 


Val 
55 


Ser 


Arg 


Gin 


Ser 


Gly 


Val 


Asn 


Val 


Thr 


65 










70 






Gly 


Gly 


Lys 


He 


Arg 
85 


Thr 


Asn 


Ser 


Gly 


Ala 


Asn 


Thr 
100 


Met 


Thr 


Glu 


Gly 


Asp 


Asp 


Leu 
115 


Gly 


Leu 


Gin 


Asp 


Lys 
120 


Lys 


Arg 
130 


Tyr 


He 


val 


Lys 


Asp 
135 


Gly 


Pro 


He 


Ser 


Leu 


Met 


Lys 


Ser 


Ser 


145 










150 






Ala 


Leu 


Phe 


Phe 


Glu 
165 


Pro 


Phe 


Leu 


Ser 


Gly 


Lys 


Val 
180 


Ser 


Glu 


Glu 


His 


Cys 


Glu 


Arg 
195 


His 


Phe 


Gly 


Arg 


Glu 
200 



Ser 


Ser 
10 


Ala 


Ser 


Ser 


His 


Pro 
15 


Tyr 


Arg 


Pro 


Arg 


Leu 


Arg 


Ala 


Val 


Leu 


25 










30 






Arg 


Ala 


Ala 


Pro 


Ala 
45 


Arg 


Ser 


Val 


Gly 


Leu 


Ala 


Ala 
60 


Ala 


Tyr 


Arg 


Leu 


Val 


Phe 


Glu 
75 


Ala 


Ala 


Asp 


Arg 


Ala 
80 


Glu 


Gly 
90 


Gly 


Phe 


Val 


Trp 


Asp 
95 


Glu 


Glu 


Trp 


Glu 


Ala 


Ser 


Arg 


Leu 


He 


105 










110 






Gin 


Gin 


Tyr 


Pro 


Asn 
125 


Ser 


Gin 


His 


Ala 


Pro 


Ala 


Leu 
140 


He 


Pro 


Ser 


Asp 


Val 


Leu 


Ser 
155 


Thr 


Lys 


Ser 


Lys 


He 
160 


Tyr 


Lys 
170 


Lys 


Ala 


Asn 


Thr 


Arg 
175 


Asn 


Leu 


Ser 


Glu 


Ser 


Val 


Gly 


Ser 


Phe 


185 










190 






Val 


Val 


Asp 


Tyr 


Phe 


Val 


Asp 


Pro 



205 
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al a m 


xnr oer Aia 


Gly Asp Pro Glu Ser 


Leu 


Ser 


He 


Arg 




z±u 






220 








His 


Ala 


iri.xxs trj, \j 


Aj.a jueu. 1 


Asn Leu Glu Argr Lys 


Tyr 


GJLy 


Ser 


Val 


225 
















240 


Tie 


Val 
vox 


fZl\r IVl a 
V9Xy AJLCl 


x±e Lieu oer 


Lys Leu Ala Ala Lys 


Gly 


Asp 


Pro 


Val 








OAR 


Oca 






255 










Aei^ Cav 

aer oer 


Gly Lys Ar^ Ar^* Asn 


Arg 


Arg 


Val 


Ser 










o cc 




270 








seiT 


Phe Hx.s 


vaxy v»xy raeii 


Gin Ser Leu lie Asn 


Ala 


Leu 


His 


Asn 










ion 


285 








Glu 


Val 




Asy /isii va± 


Lys Leu Gly Thr Glu 


Val 


Leu 


Ser 


Leu 




^ 7 u 




O Q C 


300 










Ala 


Cys 




r\sy \9±y vax 


fTo Axa LiGu LjjLy Arg 


Trp 


Ser 


He 


Ser 


305 






Tin 










320 


Val 


Asp 




Asp sec^ vv±y 


Asp Lys Asp Leu Ala 


Ser 


Asn 


Gin 


Thr 










■J Q n 






335 




Phe 


Asp 


Ala Val 


Tin Mat* TVir- 


Axa ir'ro Lieu oer Asn 


Val 


Arg 


Arg 


Met 






J 4 w 








350 




Jjys 






V9±y vsxy Axa 


Dv"A TTa1 17^1 T 1i MM 

Fro vaJL vai Leu Asp 


Phe 


Leu 


Pro 


Lys 










•a ^ A 

360 


365 






Met 






irro Lieu oer 


Lieu wet vai Tnr Ala 


Phe 


Lys 


Lys 


Asp 




7*7 ft 




375 


380 






Asp 






^1«4 

irro Jbeu ui±U 


Gxy Pne Gly Val Leu 


He 


Pro 


Tyr 


Lys 


J OS 






^ on 


one 

395 






400 


Gill 


19 J. XI 


la J.I1 uy s 


ms V9xy Lieu 


Lys Thr Leu Gly Thr 


Leu 


Phe 


Ser 


Ser 










yi 1 A 
410 






415 




Met: 


Met 


PVlA P>"n 


Asp ALg ALa 


Pro Asp Asp Gin Tyr 


Leu 


Tyr 


Thr 


Thr 






4Z u 




425 




430 






Phe 


Val 


Glv Gl V 


OCi nj.5 ASIi 


Arg Asp Leu Ala Gly 


Ala 


Pro 


Thr 


Ser 






^ J ;j 




A A f\ 


445 








lie 




jjys V9J.IX 


Lieu vax inr 


Ser Asp Leu Lys Lys 


Leu 


Leu 


Gly 


Val 










460 








Glu 


Gly 


Gin Pro 


Thr Phe Val 


Lys His Val Tyr Trp 


Gly 


Asn 


Ala 


Phe 


465 






470 


475 






480 


Pro 


Leu 


Tyr Gly 


His Asp Tyr 


Ser Ser Val Leu Glu 


Ala 


lie 


Glu 


Lys 








485 


490 






495 


Met 


Glu 


Lys Asn 


Leu Pro Gly 


Phe Phe Tyr Ala Gly 


Asn 


Ser 


Lys 


Asp 






500 




505 




510 


Gly 


Leu 


Ala Val 


Gly Ser Val 


He Ala Ser Gly Ser 


Lys 


Ala 


Ala 


Asp 






515 




520 


525 






Leu 


Ala 


He Ser 


Tyr Leu Glu 


Ser His Thr Lys His 


Asn 


Asn 


Ser 


His 




530 




535 


540 











<210> 9 
.<211> 1811 
<212> DNA 

<213> Triticum aestivuxn 

<220> 

<221> CDS 

<222> (3).. (1589) 

<223> wheat protox-1 

<400> 9 

gc gca aca atg gcc acc gcc acc gtc gcg gcc gcg teg ccg etc ege 47 
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Ala Thr Met Ala Thr Ala Thr Val Ala Ala Ala Ser Pro Leu Arg 
15 10 15 

ggc agg gtc acc ggg cgc cca cac cgc gtc cgc ccg cgt tgc get acc 95 
Gly Arg Val Thr Gly Arg Pro His Arg Val Arg Pro Arg Cys Ala Thr 
20 25 30 

gcg age age gcg aee gag aet ccg geg geg ccc ggc gtg egg ctg tec 143 
Ala Ser Ser Ala Thr Glu Thr Pro Ala Ala Pro Gly Val Arg Leu Ser 
35 40 45 

gcg gaa tgc gtc att gtg ggc gcc ggc ate age ggc etc tgc acc gcg 191 
Ala Glu Cys Val He Val Gly Ala Gly He Ser Gly Leu Cys Thr Ala 
50 55 60 

cag geg ctg gcc acc ega tac ggc gtc age gac ctg etc gtc aeg gag 239 
Gin Ala Leu Ala Thr Arg Tyr Gly Val Ser Asp Leu Leu Val Thr Glu 
65 70 75 

gcc cgc gac cgc ccg ggc ggc aac ate acc acc gtc gag cgt ccc gac 287 
Ala Arg Asp Arg Pro Gly Gly Asn He Thr Thr Val Glu Arg Pro Asp 
80 85 90 95 

gag- ggg tac ctg tgg gag gag gga ccc aac age ttc cag ccc tec gac 335 
Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp 
100 105 110 

ccg gtc etc acc atg gcc gtg gac age ggg etc aag gat gac ttg gtg 383 
Pro Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp Leu Val 
115 120 125 

ttc ggg gac ccc aac gcg ccc egg ttc gtg ctg tgg gag ggg aag ctg 431 
Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly Lys Leu 
130 135 140 

agg ccg gtg ccg teg aag cca ggc gac ctg cct ttc ttc age etc atg 479 
Arg Pro Val Pro Ser Lys Pro Gly Asp Leu Pro Phe Phe Ser Leu Met 
145 150 155 

agt ate cct ggg aag etc agg gcc ggc ctt ggc gcg etc ggc att cgc 527 
Ser He Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu Gly He Arg 
160 165 170 175 

cca cct cct cca ggg cgc gag gag teg gtg gag gag ttt gtg cgc cgc 575 
Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe Val Arg Arg 
180 185 190 

aac etc ggt gcc gag gtc ttt gag cgc etc ate gag cct ttc tgc tea 623 
Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser 
195 200 205 

ggt gta tat get ggt gat cct teg aag ctt agt atg aag get gca ttt 671 
Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe 
210 215 220 

ggg aag gtc tgg agg ttg gag gag att gga ggt agt att att ggt gga 719 
Gly Lys Val Trp Arg Leu Glu Glu He Gly Gly Ser He He Gly Gly 
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225 






230 










235 






acc 
Thr 
240 


ate 
He 


aag 
Lys 


gcg 
Ala 


att cag gat 
He Gin Asp 
245 


aaa 
Lys 


ggg 

Gly 


aag 
Lys 


aac 
Asn 
250 


ccc aaa 
Pro Lys 


ccg cca agg 
Pro Pro Arg 
255 


767 


gat 
Asp 


ccc 
Pro 


cga 
Arg 


ctt 
Leu 


ccg gea cca 
Pro Ala Pro 
260 


aag 
Lys 


gga 

Gly 


cag 
Gin 
265 


acg 
Thr 


gtg gea 
Val Ala 


tct ttc agg 
Ser Phe Arg 
270 


815 


aag 
Lys 


ggt 

Gly 


eta 
Leu 


gee 
Ala 
275 


atg etc ccg 
Met Leu Pro 


aat 
Asn 


gee 
Ala 
280 


ate 
He 


gea 
Ala 


tct agg 
Ser Arg 


ctg ggt agt 
Leu Gly Ser 
285 


863 


aaa 
Lys 


gtc 
Val 


aag 
Lys 
290 


ctg 
Leu 


tea tgg aag 
Ser Trp Lys 


ctt 
Leu 
295 


acg 
Thr 


age 
Ser 


att 
He 


aca aag 
Thr Lys 
300 


gcg gac aac 
Ala Asp Asn 


911 


caa 
Gin 


gga 
Gly 
305 


tat 
Tyr 


gta 
Val 


tta ggt tat 
Leu Gly Tyr 
310 


gaa 
Glu 


aca 
Thr 


cca 
Pro 


gaa 
Glu 


gga ctt 
Gly Leu 
315 


gtt tea gtg 
Val Ser Val 


959 


cag 
Gin 
320 


get 
Ala 


aaa 
Lys 


agt 
Ser 


gtt ate atg 
Val He Met 
325 


acc 
Thr 


ate 
He 


ccg 
Pro 


tea 
Ser 
330 


tat gtt 
Tyr Val 


get agt gat 
Ala Ser Asp 
335 


1007 


ate 
lie 


ttg 
Leu 


cgc 
Arg 


eca 
Pro 


.ctt tea att 
Leu Ser He 
340 


gat 
Asp 


gea 
Ala 


gea 
Ala 
345 


gat 
Asp 


gea etc 
Ala Leu 


tea aaa ttc 
Ser Lys Phe 
350 


1055 


tat 
Tyr 


tat 
Ty^^ 


ccg 
Pro 


eca 
Pro 
355 


gtt get get 
Val Ala Ala 


gta 
Val 


act 
Thr 
360 


gtt 
Val 


tea 
Ser 


tat eca 
Tyr Pro 


aaa gaa get 
Lys Glu Ala 
365 


1103 


att 
He 


aga 
Arg 


aaa 
Lys 
370 


gaa 
Glu 


tge tta att 
Cys Leu He 


gat 
Asp 
375 


ggg 

Gly 


gag 
Glu 


etc 
Leu 


cag ggt 
Gin Gly 
380 


ttc ggc cag 
Phe Gly Gin 


1151 


ttg 
Leu 


cat 
His 
385 


cca 
Pro 


egt 
Arg 


age caa gga 
Ser Gin Gly 
390 


gtc 
Val 


gag 
Glu 


act 
Thr 


tta 
Leu 


ggg aca 
Gly Thr 
395 


ata tat age 
He Tyr Ser 


1199 


tct 
Ser 
400 


tct 
Ser 


etc 
Leu 


ttt 
Phe 


cet aat egt 
Pro Asn Arg 
405 


get 
Ala 


cet 
Pro 


get 
Ala 


gga 

Gly 
410 


aga gtg 
Arg Val 


tta ctt ctg 
Leu Leu Leu 
415 


1247 


aac 
Asn 


tat 
Tyr 


ate 
He 


Gly 


crcrt tc?l" ar'A 
Gly Ser Thr 
420 


CLCL L. 

Asn 


aca 
Thr 


ggg 

Gly 
425 


He 


guc ccc 
Val Ser 


aag act gag 
Lys Thr Glu 
430 


1 o o c 

1295 


agt 
Ser 


gac 
Asp 


tta 
Leu 


gta 
Val 
435 


gga gee gtt 
Gly Ala Val 


gac 
Asp 


egt 
Arg 
440 


gac 
Asp 


etc 
Leu 


aga aaa 
Arg Lys 


atg ttg ata 
Met Leu He 
445 


1343 


aac 
Asn 


act 
Pro 


aga 
Arg 
450 


gca 
Ala 


gea gac cet 
Ala Asp Pro 


tta 

Leu 
455 


gca 
Ala 


tta 
Leu 


ggg 

Gly 


gtt cga 
Val Arg 
460 


gtg tgg cca 
Val Trp Pro 


1391 
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caa gca ata cca cag ttt ttg att ggg cac ctt gat cgc ctt get get 1439 
Gin Ala lie Pro Gin Phe Leu lie Gly His Leu Asp Arg Leu Ala Ala 
465 470 475 

gca aaa tct gea ctg gge caa ggc ggc tac gae ggg ttg ttc eta gga 1487 
/Via Lys Ser Ala Leu Gly Gin Gly Gly Tyr Asp Gly Leu Phe Leu Gly 
480 485 490 495 

gga aac tac gtc gca gga gtt gee ttg ggc cga tgc ate gag ggt gcg 1535 
Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys lie Glu Gly Ala 
500 505 510 

tac gag agt gcc tea caa gta tct gac ttc ttg ace aag tat gee tac 1583 
Tyr Glu Ser Ala Ser Gin Val Ser Asp Phe Leu Thr Lys Tyr Ala Tyr 
515 520 525 

aag tga tggaagtagt gcatctcttc attttgttgc atatacgagg tgaggctagg 1639 
Lys 

ateggtaaaa catcatgaga ttctgtagtg tttctttaat tgaaaaaaca aattttagtg 1699 
atgcaatatg tgetetttcc tgtagttcga gcatgtacat cggtatggga taaagtagaa 1759 
taagetattc tgcaaaagea gtgatttttt ttgaaaaaaa aaaaaaaaaa aa 1811 



<210> 10 
<211> 528 
<212> PRT 

<213> Triticum aestivum 
<400> 10 



Ala 


Thr 


Met 


Ala 


Thr 


Ala 


Thr 


Val 


Ala 


Ala 


Ala Ser Pro 


Leu 


Arg Gly 


1 








5 










10 






15 


Arg 


Val 


Thr 


Gly 


Arg 


Pro 


His 


Arg 


Val 


Arg 


Pro Arg Cys 


Ala 


Thr Ala 








20 










25 






30 




Ser 


Ser 


Ala 


Thr 


Glu 


Thr 


Pro 


Ala 


Ala 


Pro 


Gly Val Arg 


Leu 


Ser Ala 






35 










40 






45 






Glu 


Cys 


Val 


He 


Val 


Gly 


Ala 


Gly 


He 


Ser 


Gly Leu Cys 


Thr 


Ala Gin 




50 










55 








60 






Ala 


Leu 


Ala 


Thr 


Arg 


Tyr 


Gly 


Val 


Ser 


Asp 


Leu Leu Val 


Thr 


Glu Ala 


65 










70 










75 




80 


Arg 


Asp 


Arg 


Pro 


Gly 


Gly 


Asn 


lie 


Thr 


Thr 


Val Glu Arg 


Pro 


Asp Glu 










85 










90 






95 


Gly 


Tyr 


Leu 


Trp 


Glu 


Glu 


Gly 


Pro 


Asn 


Ser 


Phe Glri Pro 


Ser 


Asp Pro 








100 










105 






110 




Val 


Leu 


Thr 


Met 


Ala 


Val 


Asp 


Ser 


Gly 


Leu 


Lys Asp Asp 


Leu 


Val Phe 






115 










120 






125 






Gly 


Asp 


Pro 


Asn 


Ala 


Pro 


Arg 


Phe 


Val 


Leu 


Trp Glu Gly 


Lys 


Leu Arg 




130 










135 








140 






Pro 


Val 


Pro 


Ser 


Lys 


Pro 


Gly 


Asp 


Leu 


Pro 


Phe Phe Ser 


Leu 


Met Ser 


145 










150 










155 




160 


lie 


Pro 


Gly 


Lys 


Leu 


Arg 


Ala 


Gly 


Leu 


Gly 


Ala Leu Gly 


He 


Arg Pro 










165 










170 






175 


Pro 


Pro 


Pro 


Gly 


Arg 


Glu 


Glu 


Ser 


Val 


Glu 


Glu Phe Val 


Arg 


Arg Asn 








180 










185 






190 
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L6U 




Ala 


GJLU 


va± pne 


Glu Arg 


Leu 


He 


Glu 


Pro 


Pne 


Cys 


Ser 


Gly 






























vox 


Tyr 




vsxy 


ASp irxO 


Ser Lys 


Leu 


Ser 


neu 


Lys 


Aia 


Ik 1 0 
Aia 


Pne 


Gly 




Z J.U 
















0 1 n 












Vax 


Trp 


Arg 


Leu GXu 


laXU ±±G 


Giy 


Giy 


Ser 


lie 


Tl A 

lie 


Giy 


Gly 


Thr 




























240 


Tl P 






XXc 




Ltys uxy 


Lys 


Asn 


IrxO 


Lys 


XrxO 


Pro 


Arg 


Asp 


























0 c c 






Arg 


Leu 


Pro 


AX a irXQ 


Lys Gly 


oxn 


Thr 


vai 


al a 

Axa 


oer 


pne 


Arg 


Lys 














Zoo 










OTA 
2/0 










AT a 




1j6U JrxO 


/\Sn HJLcL 


Tie* 

lie 


Ala 


Ser 


Arg 


Leu 


Giy 


Ser 


Lys 






z /o 






0 Q n 










00c 

ZOD 






VCI.L 


Lys 


Leu 


Ser 


xrp Jjys 


j-icU inr 


Ser 


Tl « 

lie 


inr 


Lys 


Ala 


Asp 


Asn 


Gin 




0 Q n 








0 Q C 








0 A A 
JOO 












Tyr 




Leu 


Gly Tyr 


Glu Thr 


Pro 


Glu 


Gly 


Leu 


Val 


Ser 


Val 


Gin 


















one 
Jl3 










320 


ivLa 


Lys 


Ser 


VaJ. 


JLJLe met. 


Thr He 


Pro 


Ser 


l^rr 


vai 


Ala 


Ser 


Asp 


He 
















*a T n 










335 






Arg 




Leu 


oer xxe 


Asp Ala 


Ala 


ASp 


Ala 


Leu 


Ser 


Lys 


Pne 


■^nc 








*J >i n 






0 >l c 










350 






Tyr 


Pro 


Pro 


vai 


Axa Axa 


vai Tnr 


val 


Ser 




Pro 


Lys 


Glu 


Ala 


He 






"3 c c 
OD3 






3o0 










365 










Lys 


19XU 


cys 


Leu xxe 


Asp Gly 


Glu 


Leu 


Gin 


Gly 


Phe 


Gly 


Gin 


Leu 




370 








375 








380 










His 


Pro 


Arg 


Ser 


Gin Gly 


Val Glu 


Thr 


Leu 


Gly 


Thr 


He 


Tyr 


Ser 


Ser 










390 








395 










400 


Ser 


Leu 


Pne 


Pro 


Asn Arg 


Ala Pro 


Ala 


Gly 


Arg 


Val 


Leu 


Leu 


Leu 


Asn 










405 






410 










415 




Tyr 




^xy 


vjiy 


Ser Thr 


Asn Thr 


Gly 


He 


Val 


Ser 


Lys 


Thr 


Glu 


Ser 








420 






425 










430 








Leu 


Val 


Glv 


Ala Val 




A en 


XJCU 


At'^ 

iurg 






Leu 


Tl 0 

xxe 


Asn 






435 






440 










445 








Pro 


Arg 


Ala 


Ala 


Asp Pro 


Leu Ala 


Leu 


Gly 


Val 


Arg 


Val 


Trp 


Pro 


Gin 




450 








455 








460 










Ala 


He 


Pro 


Gin 


Phe Leu 


He Gly 


His 


Leu 


Asp 


Arg 


Leu 


Ala 


Ala 


Ala 


465 








470 








475 










480 


Lys 


Ser 


Ala 


Leu 


Gly Gin 


Gly Gly 


Tyr 


Asp 


Gly 


Leu 


Phe 


Leu 


Gly 


Gly 










485 






490 










495 


Asn 


Tyr 


Val 


Ala 


Gly Val 


Ala Leu 


Gly 


Arg 


Cys 


He 


Glu 


Gly 


Ala 


Tyr 








500 






505 










510 






Glu 


Ser 


Ala 


Ser 


Gin Val 


Ser Asp 


Phe 


Leu 


Thr 


Lys 




Ala 


Tyr 


Lys 



515 520 525 



<210> 11 
<211> 1847 
<212> DMA 
<213> Glycine max 

<220> 
<221> CDS 

<222> (55).. (1683) 
<223> soybean protox-1 

<400> 11 

ctttagcaca gtgttgaaga taacgaacga atagtgccat tactgtaacc aacc atg 57 
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Met 
1 

gtt tec gtc ttc aac gag ate eta ttc ceg eeg aac caa ace ctt ctt 105 
Val Ser Val Phe Asn Glu lie Leu Phe Pro Pro Asn Gin Thr Leu Leu 
5 10 15 

cgc ccc tec etc cat tec cca ace tct ttc ttc acc tct ccc act cga 153 
Arg Pro Ser Leu His Ser Pro Thr Ser Phe Phe Thr Ser Pro Thr Arg 
20 25 30 

aaa ttc cot cgc tct cgc cct aac cct att eta cgc tgc tec att gcg 201 
Lys Phe Pro Arg Ser Arg Pro Asn Pro lie Leu Arg Cys Ser He Ala 
35 40 45 

gag gaa tec acc gcg tct ceg ccc aaa acc aga gac tec gee ccc gtg 249 
Glu Glu Ser Thr Ala Ser Pro Pro Lys Thr Arg Asp Ser Ala Pro Val 
50 55 60 65 

gac tgc gtc gtc gtc ggc gga ggc gtc age ggc etc tgc ate gee cag 297 
Asp Cys Val Val Val Gly Gly Gly Val Ser Gly Leu Cys He Ala Gin 
70 75 80 

gee etc gee acc aaa eac gee aat gee aac gtc gtc gtc acg gag gee 345 
Ala Leu Ala Thr Lys His Ala Asn Ala Asn Val Val Val Thr Glu Ala 
85 90 95 

cga gac cgc gtc ggc ggc aac ate acc acg atg gag agg gac gga tac 393 
Arg Asp Arg Val Gly Gly Asn He Thr Thr Met Glu Arg Asp Gly Tyr 
100 105 110 

etc tgg gaa gaa ggc ccc aac age ttc cag cct tct gat cca atg etc 441 
Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro Met Leu 
115 120 125 

acc atg gtg gtg gac agt ggt tta aag gat gag ctt gtt ttg ggg gat 489 
Thr Met Val Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu Gly Asp 
130 135 140 145 

cct gat gea cct egg ttt gtg ttg tgg aac agg aag ttg agg ceg gtg 537 
Pro Asp Ala Pro Arg Phe Val Leu Trp Asn Arg Lys Leu Arg Pro Val 
150 155 160 

ccc ggg aag ctg act gat ttg cct ttc ttt gac ttg atg age att ggt 585 
Pro Gly Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He Gly 
165 170 175 

ggc aaa ate agg get ggc ttt ggt gcg ctt gga att egg cct cct cct 633 
Gly Lys He Arg Ala Gly Phe Gly Ala Leu Gly He Arg Pro Pro Pro 
180 185 190 

cca ggt cat gag gaa teg gtt gaa gag ttt gtt cgt egg aac ctt ggt 681 
Pro Gly His Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly 
195 200 205 

gat gag gtt ttt gaa egg ttg ata gag cct ttt tgt tea ggg gtc tat 729 
Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly Val Tyr 
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210 215 220 225 

gca ggc gat cct tea aaa tta agt atg aaa gca gca ttc ggg aaa gtt 777 
Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Lys Val 
230 235 240 



tgg aag ctg gaa aaa aat ggt ggt age att att ggt gga act ttc aaa 
Trp Lys Leu Glu Lys Asn Gly Gly Ser lie lie Gly Gly Thr Phe Lys 
245 250 255 



825 



gca ata caa gag aga aat gga get tea aaa cca cct cga gat ccg cgt 873 
Ala lie Gin Glu Arg Asn Gly Ala Ser Lys Pro Pro Arg Asp Pro Arg 
260 265 270 

ctg cca aaa cca aaa ggt cag act gtt gga tct ttc egg aag gga ctt 921 
Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu 
275 280 285 

ace atg ttg cct gat gea att tet gee aga eta ggc aae aaa gta aag 969 
Thr Met Leu Pro Asp Ala He Ser Ala Arg Leu Gly Asn Lys Val Lys 
290 295 300 305 

tta tct tgg aag ctt tea agt att agt aaa ctg gat agt gga gag tac 1017 
Leu Ser Trp Lys Leu Ser Ser He Ser Lys Leu Asp Ser Gly Glu Tyr 
310 315 320 

agt ttg aca tat gaa aca cca gaa gga gtg gtt tct ttg cag tge aaa 1065 
Ser Leu Thr Tyr Glu Thr Pro Glu Gly Val Val Ser Leu Gin Cys Lys 
325 330 335 

act gtt gtc ctg acc att cct tec tat gtt get agt aca ttg ctg cgt 1113 
Thr Val Val Leu Thr He Pro Ser Tyr Val Ala Ser Thr Leu Leu Arg 
340 345 350 

cet ctg tet get get get gca gat gea ctt tea aag ttt tat tae ect 1161 
Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser Lys Phe Tyr Tyr Pro 
355 360 365 

eea gtt get gea gtt tec ata tec tat cca aaa gaa get att aga tea 1209 
Pro Val Ala Ala Val Ser He Ser Tyr Pro Lys Glu Ala He Arg Ser 
370 375 380 385 

gaa tgc ttg ata gat ggt gag ttg aag ggg ttt ggt caa ttg cat cca 1257 
Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro 
390 395 400 

cgt age caa gga gtg gaa aca tta gga act ata tac age tea tea eta 1305 
Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser Ser Leu 
405 410 415 

ttc ece aac ega gea cca cct gga agg gtt eta etc ttg aat tae att 1353 
Phe Pro Asn Arg Ala Pro Pro Gly Arg Val Leu Leu Leu Asn Tyr He 
420 425 430 

gga gga gca act aat act gga att tta teg aag aeg gae agt gaa ctt 1401 
Gly Gly Ala Thr Asn Thr Gly He Leu Ser Lys Thr Asp Ser Glu Leu 
435 440 445 
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gtg gaa aca gtt gat cga gat ttg agg aaa ate ctt ata aac cca aat 1449 
Val Glu Thr Val Asp Arg Asp Leu Arg Lys He Leu He Asn Pro Asn 
450 455 460 465 

gcc cag gat cca ttt gta gtg ggg gtg aga ctg tgg cct caa get att 1497 
Ala Gin Asp Pro Phe Val Val Gly Val Arg Leu Trp Pro Gin Ala He 
470 475 480 

cca cag tte tta gtt ggc cat ctt gat ctt eta gat gtt get aaa get 1545 
Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Asp Val Ala Lys Ala 
485 490 495 

tct ate aga aat act ggg ttt gaa ggg etc tte ctt ggg ggt aat tat 1593 
Ser He Arg Asn Thr Gly Phe Glu Gly Leu Phe Leu Gly Gly Asn Tyr 
500 505 510 

gtg tct ggt gtt gee ttg gga cga tge gtt gag gga gee tat gag gta 1641 
Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Val 
515 520 525 

gca get gaa gta aac gat ttt etc aca aat aga gtg tac aaa 1683 
Ala Ala Glu Val Asn Asp Phe Leu Thr Asn Arg Val Tyr Lys 
530 535 540 

tagtagcagt ttttgttttt gtggtggaat gggtgatggg aetctcgtgt tccattgaat 1743 

tataataatg tgaaagtttc tcaaattcgt tcgataggtt tttggcggct tctattgctg 1803 

ataatgtaaa atcctcttta agtttgaaaa aaaaaaaaaa aaaa 1847 



<210> 12 

<211> 543 

<212> PRT 

<213> Glycine max 

<400> 12 

Met Val Ser Val Phe Asn Glu He 
1 5 

Leu Arg Pro Ser Leu His Ser Pro 
20 

Arg Lys Phe Pro Arg Ser Arg Pro 
35 40 

Ala Glu Glu Ser Thr Ala Ser Pro 
50 55 

Val Asp Cys Val Val Val Gly Gly 
65 70 

Gin Ala Leu Ala Thr Lys His Ala 
85 

Ala Arg Asp Arg Val Gly Gly Asn 



Leu Phe 
10 

Thr Ser 

25 

Asn Pro 

Pro Lys 

Gly Val 

Asn Ala 
90 

He Thr 



Pro Pro 

Phe Phe 

He Leu 

Thr Arg 
60 

Ser Gly 
75 

Asn Val 
Thr Met 



Asn Gin 

Thr Ser 
30 

Arg Cys 
45 

Asp Ser 
Leu Cys 
Val Val 
Glu Arg 



Thr Leu 
15 

Pro Thr 

Ser He 

Ala Pro 

He Ala 
80 

Thr Glu 
95 

Asp Gly 
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100 



105 



110 



Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro Met 
115 120 125 

Leu Thr Met Val Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu Gly 
130 135 140 

Asp Pro Asp Ala Pro Arg Phe Val Leu Trp Asn Arg Lys Leu Arg Pro 
145 150 155 160 

Val Pro Gly Lys Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He 
165 170 175 

Gly Gly Lys He Arg Ala Gly Phe Gly Ala Leu Gly He Arg Pro Pro 
180 185 190 

Pro Pro Gly His Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu 
195 200 205 

Gly Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly Val 
210 215 220 

Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala T^a Phe Gly Lys 
225 230 235 240 

Val Trp Lys Leu Glu Lys Asn Gly Gly Ser He He Gly Gly Thr Phe 
245 250 255 

Lys Ala He Gin Glu Arg Asn Gly Ala Ser Lys Pro Pro Arg Asp Pro 
260 265 270 

Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys Gly 
275 280 285 

Leu Thr Met Leu Pro Asp Ala He Ser Ala Arg Leu Gly Asn Lys Val 
290 295 300 

Lys Leu Ser Trp Lys Leu Ser Ser He Ser Lys Leu Asp Ser Gly Glu 
305 310 315 320 

Tyr Ser Leu Thr Tyr Glu Thr Pro Glu Gly Val Val Ser Leu Gin Cys 
325 330 335 

Lys Thr Val Val Leu Thr He Pro Ser Tyr Val Ala Ser Thr Leu Leu 
340 345 350 

Arg Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser Lys Phe Tyr Tyr 
355 360 365 

Pro Pro Val Ala Ala Val Ser He Ser Tyr Pro Lys Glu Ala He Arg 
370 375 380 

Ser Glu Cys Leu He Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His 
385 390 395 400 



Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser Ser 
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405 



410 



415 



Leu Phe Pro Asn Arg Ala Pro Pro Gly Arg Val Leu Leu Leu Asn Tyr 

420 425 430 

lie Gly Gly Ala Thr Asn Thr Gly He Leu Ser Lys Thr Asp Ser Glu 
435 440 445 

Leu Val Glu Thr Val Asp Arg Asp Leu Arg Lys He Leu He Asn Pro 
450 455 460 

Asn Ala Gin Asp Pro Phe Val Val Gly Val Arg Leu Trp Pro Gin Ala 
465 470 475 480 

He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu Asp Val Ala Lys 
485 490 495 

Ala Ser He Arg Asn Thr Gly Phe Glu Gly Leu Phe Leu Gly Gly Asn 
500 505 510 

Tyr Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu 
515 520 525 



Val Ala Ala Glu Val Asn Asp Phe Leu Thr Asn Arg Val Tyr Lys 
530 535 540 



<210> 13 
<211> 583 
<212> DNA 

<213> Arabidopsis tha liana 

<220> 

<221> misc_feature 
<222> (1) ..(583) 

<223> arabidopsis protox-1 promoter 



<400> 13 

gaattccgat cgaattatat aattatcata 
agaggtttaa taaagtttgg taataatgga 
taattaatat ttacatcaaa atttggtcac 
taattcgca.a ataaaacact aattccaaat 
cttgataaag caaagcaaaa ataatgggtt 
aaaaaaggtt tggttatata tctattgggc 
actaaaataa taiaaataaac gtaatggtcc 
acccaaacca aagaaaaagt atacggtacg 
ggtgaatatt tctcgtcgtc ttctcctttc 
accaagaagc tgacaaaatt ccgaattctc 



aatttgaata agcatgttgc cttttattaa 60 
ctttgacttc aaactcgatt ctcatgtaat 120 
taatattacc aaattaatat actaaaatgt 180 
aaagggtcat tatgataaac acgtattgaa 240 
tcaaggtttg ggttatatat gacaaaaaaa 300 
ctataaccat gttatacaaa tttgggccta 360 
tttttatatt tgggtcaaac ccaactctaa 420 
gtacacagac ttatggtgtg tgtgattgca 480 
ttctgaagaa gattacccaa tctgaaaaaa 540 
tgcgatttcc atg 583 



<210> 14 
<211> 3848 
<212> DNA 
<213> Zea mays 



<220> 
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<221> inisc_f eature 

<222> (1) . . (3848) 

<223> maize protox-1 promoter 

<400> 14 

tcgatctttc taggctgatc cccaaatctt 
ggagctggtg gcctgaaaga gctttgctgt 
ctctgagact gacttccttt gtcgtcactt 
ctcagatgga ttcttcctcc gaagcccctg 
cttctttggc gaaaatctgt cagcttggat 
cagagtttgt ggaggcttcc tggcgaaata 
gatcatggcc tcaatgacaa tctcattggg 
gaaccttcgt acatatgcct gaaggtattc 
ctgagctgtg accgacttcg tttgaaagcc 
cttctgccac gacgtgatag tccctggccg 
ccggactgcc atgacgaagg acttcgccat 
agttgcttcg tagctcatca gaaactgctt 
ctgaggtggc ttgtatgatg ggggccatgg 
agcatcatca aaagtaaagg catcatgatt 
taagccttct tgacgaagct ccctgtgttg 
atgacgcact tcttcagtgg cttcgtcgat 
cttctccttc tttctttgta cttgttgatg 
caactcctcc tcttggagtg tcagactggt 
aagagaaaga gtttcttgat ttgggtccag 
agctttcttc ggtggcatga caaaggtcag 
ctagaggtgg gagccaatgt tggggacttc 
acaaaatgtt aaatattaat agctttcatc 
tgatcttcag acgaaagagt ccttcatcat 
atgaaatgta agagacaaca tgaacaatcg 
tattatggaa aaatagaaac aatattgaat 
aaaagtacaa gcttgacgca cgagcaagta 
ttcatctatt tataggcaca ggacacagcc 
ttactattga cttatagaaa aatctatgag 
tgcctttttc cgcgattaag ccgaatctcc 
ttcgtcacga tcatgccctt ctcattgtgt 
gtccataaac catacttgga agacattgtt 
gaaggccccc aacagtcgtg tttttgagga 
ctatccataa aaccaaccta tccacaaaac 
aacaacccta attaggttgt tggtttaaat 
actgtcactc cacaaactca atatcaataa 
cataaaaccg ccccaccctt ctagcgcctc 
caaacttaaa acgaccataa ctttcacctt 
atcacacaaa attaaatttc gcatccgata 
tgttgctcac actagtgtat ttatggacta 
cagtacatct aagttgttac tcaattacca 
tcgactagtc actcaattac caaaactaaa 
gacactgtac tggactaaac cacctttcaa 
cgtagttgta ggagctaaag tatatgtcca 
ttaaaagtcc ttttacctct tgaaactttt 
aaaatttgac attttatcac cccttaactc 
attatagatg attttgttgt gaaaagtttt 
tttgttcaat ttcctagagt taaatctaat 
agctctaaat atttttattt tttcattatg 
tttgtggttt aaaagccttg ccatgttttt 
tcttggaaac cacttctaac ccggtagaag 
aaatcaactt atgaaattgt cttggaaact 
aaaattaaac caacttacgg aatcgcccaa 



cctccgaagc ccctggcgcc tctgcccctt 60 
tgccccgaag attgtgaggt atattgtgac 120 
tgagtggagt tatggattga cctgacgtgc 180 
gtcatttcgg agaatctgta atcttattcc 240 
gtactcatcc atcttctgaa gcagcttctc 300 
ttgggctgta ggtcctggac gaagaccctt 360 
caccgtaggc gcttgtgccc tcaatcgcaa 420 
ttcgtgatct tgtgtgcatt ggaacagagc 480 
ttggaagcta gtaaccaaca tgtgcttaag 540 
aagagaagaa taccatgttt gggctacatt 600 
gactacagtg ttgaccccat acgaagatat 660 
tggatctgag tgcccatcat acatggggag 720 
ggtagcctgc agttctgctg ccaagggaga 780 
aaaatcatca taccatccat cctcgttgaa 840 
gggccttcga tcttgttcat cttgaacaag 900 
ctttctttgg agatcagcca gtcgcaccat 960 
gatgatctcc atgtccctga tctcttggtc 1020 
ggctttcctc ttctggcttc gagcctctcg 1080 
cggctgcagt gcagtggtcc ctggtgctga 1140 
tgcttgccga aggtggtcga aaagggttca 1200 
tcaagtgcta tgagttaaga acaaggcaac 1260 
tttcgaagca ttatttccct ttgggtataa 1320 
tgcgatatat gttaatagaa ggaggagcat 1380 
tgtagcattg ttaattcatc atcattttat 1440 
tacaaatgta cctttggctt gacagaagat 1500 
caagtcagtg tgaacagtac gggggtactg 1560 
tgtgagaaat tacagtcatg ccctttacat 1620 
gactggatag ccttttcccc tttaagtcgg 1680 
cttgcgcata gcttcggagc atcggcaacc 1740 
atgcttttaa tcctgaattc gaaggtacct 1800 
aaattatgtt tttgaggacc ttcggaggac 1860 
ccttcggaag atgaaggccc ccaacaagac 1920 
cgaccccatt cacccttcat ttgcctcacc 1980 
tttttagggt caatttggtc atcaccatcc 2040 
acagactcaa tcacccaaac tgaccatacc 2100 
gccagaaacc agaaaccctg attcagagtt 2160 
ggaactcgaa tcaggtccat ttttttccaa 2220 
atcaagccat ctcttcacta tggttttaag 2280 
atcacctgtg tatctcatac aataacatat 2340 
aaaccgaatt atagccttcg aaaaaggtta 2400 
ctttagactt tcatgtatga catccaacat 2460 
gctacacaag gagcaaaaat aactaatttt 2520 
caacaatagt taagggaagc ccccaaggac 2580 
gtcgtggtct actttttcac tttaaacttc 2640 
ttaaaaccat ttaaattaca ttcttactag 2700 
taagacatgt ttacacattg attaaaatca 2760 
cttattaaaa ctattagaga tactttcacg 2820 
gaattttgtt agaattctta tagacctttt 2880 
aacaagtttt ttttctattt tttgaaattt 2940 
atttattttg ctacacttat atctacaaca 3000 
acctctaacc cggtagaatg aatttgaatg 3060 
catatgtcga ttaaagtgga tatggataca 3120 
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tatgaagaag ccctagagat aatctaaatg 
ttgatgggaa gataagacca taacggtagt 
aaatatttgt gctgcaattg atcctgtgcc 
ctagagcgaa caaggcccac gtcacccgtg 
ctttgagagg gattggatat caacggaacc 
acctgtaacg ttccagtggg ccatccttaa 
tcgtgtcatc cactccgccg cacaggcgct 
cgccacagcc accgccatgg ccaccgctgc 
tgcgcggctc cgccatcgag gactcagcgt 
cgaggcaccg gcatccaccg gcgcgcggct 
catcagtggc ctctgcaccg cgcaggcgct 
tgtcacggag gcccgcgccc gccccggcgg 
agggtacc 



gtttcagaat tgagggttat tttttgaagt 3180 
tcacagagat aaaagggtta tttttttcag 3240 
tcaaattcag cctgcaacca aggccaggtt 3300 
gcccgtcagg cgaagcaggt cttgtgcaga 3360 
aatcacgcac ggcaatgcga ttcccagccc 3420 
ctccaagccc aacggcccta ccccatctcg 3480 
cagctccgca acgccgccgg aaatggtcgc 3540 
atcgccgcta ctcaacggga cccgaatacc 3600 
gcgctgcgct gctgtggcgg gcggcgcggc 3660 
gtccgcggac tgcgttgtgg tgggcggagg 3720 
ggccacgcgg cacggcgtcg gggacgtgct 3780 
caacattacc accgtcgagc gccccgagga 3840 

3848 



<210> 15 
<211> 1826 
<212> DNA 

<213> Gossypium hirsutum 

<220> 

<221> CDS 

<222> (31) (1647) 

<223> cotton protox-1 coding sequence 

<400> 15 

cctctcgctc gcctggcccc accaccaatc atg acg get eta ate gac ctt tet 54 

Met Thr Ala Leu lie Asp Leu Ser 
1 5 

ctt etc cgt tec teg ccc tec gtt tec cet tte tec ata ccc cae cac 102 
Leu Leu Arg Ser Ser Pro Ser Val Ser Pro Phe Ser lie Pro His His 
10 15 20 

cag cat ccg ccc cgc ttt cgt aaa cct tte aag etc ega tgc tec etc 150 
Gin His Pro Pro Arg Phe Arg Lys Pro Phe Lys Leu Arg Cys Ser Leu 
25 '30 35 * 40 

gee gag ggt ccc acg att tec tea tet aaa ate gac ggg gga gaa tea 198 
Ala Glu Gly Pro Ifhx He Ser Ser Ser Lys He Asp Gly Gly Glu Ser 
45 50 55 

tec ate geg gat tgc gtc ate gtt gga ggt ggt ate agt gga ctt tgc 246 
Ser He Ala Asp Cys Val He Val Gly Gly Gly He Ser Gly Leu Cys 
60 65 7Q 

att get caa get etc gee ace aag cac cgt gac gtc get tec aat gtg 294 
He Ala Gin Ala Leu Ala Thr Lys His Arg Asp Val Ala Ser Asn Val 
75 80 85 

att gtg acg gag gcc aga gac cgt gtt ggt ggc aac ate act acc gtt 342 
He Val Thr Glu Ala Arg Asp Arg Val Gly Gly Asn He Thr Thr Val 
90 95 100 

gag aga gat gga tat ctg tgg gaa gaa ggc ccc aac agt ttt cag ccc 390 
Glu Arg Asp Gly Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro 
105 110 . 115 120 
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tec gat cct att eta aee atg gcc gtg gat agt gga ttg aag gac gat 438 
Ser Asp Pro He Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Asp 
125 130 135 

ttg gtt tta ggt gac cct aat gca ccg cga ttt gta eta tgg gag gga 486 
Leu Val Leu Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu Gly 
140 145 150 

aaa eta agg cct gtg cec tec aag cca acc gac ttg ccg ttt ttt gat 534 
Lys Leu Arg Pro Val Pro Ser Lys Pro Thr Asp Leu Pro Phe Phe Asp 
155 160 165 

ttg atg age att get gga aaa ctt agg get ggg ttc ggg get att ggc 582 
Leu Met Ser He Ala Gly Lys Leu Arg Ala Gly Phe Gly Ala He Gly 
170 175 180 

att egg cct cec cct ccg ggt tat gaa gaa teg gtg gag gag ttt gtg 630 
He Arg Pro Pro Pro Pro Gly Tyr Glu Glu Ser Val Glu Glu Phe Val 
185 190 195 200 

cgc cgt aat ctt ggt get gag gtt ttt gaa cgc ttt att gaa cca ttt 678 
Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Phe He Glu Pro Phe 
205 210 215 

tgt tea ggt gtt tat gca ggg gat cct tea aaa tta age atg aaa gca 726 
Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys Ala 
220 225 230 

gca ttt gga aga gta tgg aag eta gaa gag att ggt ggc age ate att 774 
Ala Phe Gly Arg Val Trp Lys Leu Glu Glu He Gly Gly Ser He He 
235 240 245 

ggt ggc act ttc aag aca ate eag gag aga aat aag aca cct aag cca 822 
Gly Gly Thr Phe Lys Thr He Gin Glu Arg Asn Lys Thr Pro Lys Pro 
250 255 260 

cec aga gac ccg cgt etg cca aaa ccg aag ggc caa aca gtt gga tct 870 
Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser 
265 270 275 280 

ttt agg aag gga ctt ace atg etg cct gag gca att get aac agt ttg 918 
Phe Arg Lys Gly Leu Thr Met Leu Pro Glu Ala He Ala Asn Ser Leu 
285 290 295 

ggt age aat gta aaa tta tct tgg aag ctt tee agt att acc aaa ttg 966 
Gly Ser Asn Val Lys Leu Ser Trp Lys Leu Ser Ser He Thr Lys Leu 
300 305 310 

ggc aat gga ggg tat aac ttg aca ttt gaa aca cct gaa gga atg gta 1014 
Gly Asn Gly Gly Tyr Asn Leu Thr Phe Glu Thr Pro Glu Gly Met Val 
315 320 325 

tct ctt eag agt aga agt gtt gta atg aee att cca tec cat gtt gee 1062 
Ser Leu Gin Ser Arg Ser Val Val Met Thr He Pro Ser His Val Ala 
330 335 340 
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agt aac ttg ttg cat cct etc teg get get get gca gat gca tta tec 1110 
Ser Asn Leu Leu His Pro Leu Ser Ala Ala Ala Ala Asp Ala Leu Ser 
345 350 355 360 

caa ttt tat tat cet cca gtt gea tea gtc aca gtc tec tat cca aaa 1158 
Gin Phe Tyr Tyr Pro Pro Val Ala Ser Val Thr Val Ser Tyr Pro Lys 
365 370 375 

gaa gcc att cga aaa gaa tgt ttg att gat ggt gaa ctt aag ggg ttt 1206 
Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Lys Gly Phe 
380 385 390 

ggc cag ttg cac cca cgc age caa gga att gaa act tta ggg acg ata 1254 
Gly Gin Leu His Pro Arg Ser Gin Gly lie Glu Thr Leu Gly Thr lie 
395 400 405 

tac agt tea tea ctt tte cec aat cga get cca tct ggc agg gtg ttg 1302 
Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Ser Gly Arg Val Leu 
410 415 420 

etc ttg aac tac ata gga gga get ace aac act gga att ttg tec aag 1350 
Leu Leu Asn Tyr lie Gly Gly Ala Thr Asn Thr Gly He Leu Ser Lys 
425 430 435 440 

act gaa ggg gaa ctt gta gaa gca gtt gat cgt gat ttg aga aaa atg 1398 
Thr Glu Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys Met 
445 450 455 

ctt ata aat cct aat gca aag gat cct ctt gtt ttg ggt gta aga gta 1446 
Leu He Asn Pro Asn Ala Lys Asp Pro Leu Val Leu Gly Val Arg Val 
460 465 470 

tgg cca aaa gee att cca cag tte ttg gtt ggt cat ttg gat etc ctt 1494 
Trp Pro Lys Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu Leu 
475 480 485 

gat agt gca aaa atg get etc agg gat tct ggg ttt eat gga ctg ttt 1542 
Asp Ser Ala Lys Met Ala Leu Arg Asp Ser Gly Phe His Gly Leu Phe 
490 495 500 

ctt ggg ggc aac tat gta tct ggt gtg gca tta gga egg tgt gtg gaa 1590 
Leu Gly Gly Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys Val Glu 
505 510 515 520 

ggt get tac gag gtt gca get gaa gtg aag gaa tte ctg tea caa tat 1638 
Gly Ala Tyr Glu Val Ala Ala Glu Val Lys Glu Phe Leu Ser Gin Tyr 
525 530 535 

gea tac aaa taatattgaa attcttgtca ggctgcaaat gtagaagtca 1687 

Ala Tyr Lys 

gttattggat agtatctett tagctaaaaa attgggtagg gttttttttg ttagttcctt 1747 
gaccactttt tggggttttc attagaactt catatttgta tatcatgttg caatatcaaa 1807 
aaaaaaaaaa aaaaaaaaa 1826 
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<210> 16 
<211> 539 
<212> PRT 

<213> Gossypium hirsutum 
<400> 16 

Met Thr Ala Leu lie Asp Leu Ser Leu Leu Arg Ser Ser Pro Ser Val 
15 10 15 

Ser Pro Phe Ser lie Pro His His Gin His Pro Pro Arg Phe Arg Lys 
20 25 30 

Pro Phe Lys Leu Arg Cys Ser Leu Ala Glu Gly Pro Thr lie Ser Ser 
35 40 45 

Ser Lys lie Asp Gly Gly Glu Ser Ser lie Ala Asp Cys Val He Val 
50 55 60 

Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu Ala Thr Lys 
65 70 75 80 

His Arg Asp Val Ala Ser Asn Val He Val Thr Glu Ala Arg Asp Arg 
85 90 95 

Val Gly Gly Asn He Thr Thr Val Glu Arg Asp Gly Tyr Leu Trp Glu 
100 105 110 

Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Pro He Leu Thr Met Ala 
115 120 125 

Val Asp Ser Gly Leu Lys Asp Asp Leu Val Leu Gly Asp Pro Asn Ala 
130 135 140 

Pro Arg Phe Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys 
145 150 155 160 

Pro Thr Asp Leu Pro Phe Phe Asp Leu Met Ser He Ala Gly Lys Leu 
165 170 175 

Arg Ala Gly Phe Gly Ala He Gly He Arg Pro Pro Pro Pro Gly Tyr 
180 185 190 

Glu Glu Ser Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val 
195 200 205 

Phe Glu Arg Phe He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp 
210 215 220 

Pro Ser Lys Leu Ser Met Lys Ala Ala Phe Gly Arg Val Trp Lys Leu 
225 230 235 240 

Glu Glu He Gly Gly Ser He He Gly Gly Thr Phe Lys Thr He Gin 
245 250 255 

Glu Arg Asn Lys Thr Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys 
260 265 270 
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Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Thr Met Leu 
275 280 285 

Pro Glu Ala lie Ala Asn Ser Leu Gly Ser Asn Val Lys Leu Ser Trp 
290 295 300 

Lys Leu Ser Ser He Thr Lys Leu Gly Asn Gly Gly Tyr Asn Leu Thr 
305 310 315 320 

Phe Glu Thr Pro Glu Gly Met Val Ser Leu Gin Ser Arg Ser Val Val 
325 330 335 

Met Thr He Pro Ser His Val Ala Ser Asn Leu Leu His Pro Leu Ser 
340 345 350 

Ala Ala Ala Ala Asp Ala Leu Ser Gin Phe Tyr Tyr Pro Pro Val Ala 

355 360 365 

Ser Val Thr Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu 
370 375 380 

He Asp Gly Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Ser Gin 
385 390 395 400 

Gly He Glu Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn 
405 410 415 

Arg Ala Pro Ser Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala 
420 425 430 

Thr Asn Thr Gly He Leu Ser Lys Thr Glu Gly Glu Leu Val Glu Ala 
435 440 445 

Val Asp Arg Asp Leu Arg Lys Met Leu He Asn Pro Asn Ala Lys Asp 
450 455 460 

Pro Leu Val Leu Gly Val Arg Val Trp Pro Lys Ala He Pro Gin Phe 
465 470 475 480 

Leu Val Gly His Leu Asp Leu Leu Asp Ser Ala Lys Met Ala Leu Arg 
485 490 495 

Asp Ser Gly Phe His Gly Leu Phe Leu Gly Gly Asn Tyr Val Ser Gly 
500 505 510 

Val Ala Leu Gly Arg Cys Val Glu Gly Ala Tyr Glu Val Ala Ala Glu 
515 520 525 



Val Lys Glu Phe Leu Ser Gin Tyr Ala Tyr Lys 
530 535 



<210> 17 
<211> 1910 
<212> DNA 
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<213> Beta vulgaris 

<220> 

<221> CDS 

<222> (1)..(1680) 

<223> sugar beet protox--l coding sequence 



<400> 17 

atg aaa tea atg gcg tta tea aae tgc att cca cag aca eag tge atg 48 
Met Lys Ser Met Ala Leu Ser Asn Cys lie Pro Gin Thr Gin Cys Met 
15 10 15 

cca ttg cgc age age ggg cat tac agg ggt aat tgt ate atg ttg tea 96 
Pro Leu Arg Ser Ser Gly His Tyr Arg Gly Asn Cys He Met Leu Ser 
20 25 30 

att cca tgt agt tta att gga aga cga ggt tat tat tea cat aag aag 144 
He Pro Cys Ser Leu He Gly Arg Arg Gly Tyr Tyr Ser His Lys Lys 
35 40 45 

agg agg atg age atg agt tgc age aca age tea ggc tea aag tea gcg 192 
Arg Arg Met Ser Met Ser Cys Ser Thr Ser Ser Gly Ser Lys Ser Ala 
50 55 60 

gtt aaa gaa gea gga tea gga tea ggt gea gga gga ttg eta gae tgc 240 
Val Lys Glu Ala Gly Ser Gly Ser Gly Ala Gly Gly Leu Leu Asp Cys 
65 70 75 80 

gta ate gtt gga ggt gga att age ggg ctt tgc ate gcg cag get ett 288 
Val He Val Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu 
85 90 95 

tgt aca aaa cac tec tct tec tct tta tec cca aat ttt ata gtt aca 336 
Cys Thr Lys His Ser Ser Ser Ser Leu Ser Pro Asn Phe He Val Thr 
100 105 110 

gag gee aaa gae aga gtt ggc ggc aae ate gtc act gtg gag gcc gat 384 
Glu Ala Lys Asp Arg Val Gly Gly Asn He Val Thr Val Glu Ala Asp 
115 120 125 

ggc tat ate tgg gag gag gga cec aat age ttc eag cet tee gae gcg 432 
Gly Tyr He Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Ala 
130 135 140 

gtg etc acc atg gcg gtc gae agt ggc ttg aaa gat gag ttg gtg etc 480 
Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu 
145 150 155 160 

gga gat ccc aat get cct cgc ttt gtg eta tgg aat gae aaa tta agg 528 
Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Asn Asp Lys Leu Arg 
165 170 175 

ccc gta cct tec agt etc acc gae etc cct ttc ttc gae etc atg acc 576 
Pro Val Pro Ser Ser Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Thr 
180 185 190 

att ecg ggc aag att agg get get ctt ggt get etc gga ttt cgc cct 624 
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lie Pro Gly Lys lie Arg Ala Ala Leu Gly Ala Leu Gly Phe Arg Pro 
195 200 205 

tct cct cca cct cat gag gaa tct gtt gaa cac ttt gtg cgt cgt aat 672 
Ser Pro Pro Pro His Glu Glu Ser Val Glu His Phe Val Arg Arg Asn 
210 215 220 

etc gga gat gag gtc ttt gaa cgc ttg att gaa ccc ttt tgt tea ggt 720 
Leu Gly Asp Glu Val Phe Glu Arg Leu lie Glu Pro Phe Cys Ser Gly 
225 230 235 240 

gtg tat gcc ggt gat cct gcc aag ctg agt atg aaa get get ttt ggg 768 
Val Tyr Ala Gly Asp Pro Ala Lys Leu Ser Met Lys Ala Ala Phe Gly 
245 250 255 

aag gtc tgg aag ttg gag caa aag ggt ggc age ata att ggt ggc act 816 
Lys Val Trp Lys Leu Glu Gin Lys Gly Gly Ser lie lie Gly Gly Thr 
260 265 270 

etc aaa get ata cag gaa aga ggg agt aat cct aag ecg ccc cgt gac 864 
Leu Lys Ala lie Gin Glu Arg Gly Ser Asn Pro Lys Pro Pro Arg Asp 
275 280 285 

cag cgc etc cct aaa* cca aag ggt cag act gtt gga tec ttt aga aag 912 
Gin Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys 
290 295 300 

gga etc gtt atg ttg cct ace gee att tct get ega ett ggc agt aga 960 
Gly Leu Val Met Leu Pro Thr Ala lie Ser Ala Arg Leu Gly Ser Arg 
305 310 315 320 

gtg aaa eta tct tgg acc ett tct agt ate gta aag tea etc aat gga 1008 
Val Lys Leu Ser Trp Thr Leu Ser Ser He Val Lys Ser Leu Asn Gly 
325 330 335 

gaa tat agt ctg act tat gat ace cca gat ggc ttg gtt tct gta aga 1056 
Glu Tyr Ser Leu Thr Tyr Asp Thr Pro Asp Gly Leu Val Ser Val Arg 
340 345 350 

acc aaa agt gtt gtg atg act gtt cca tea tat gtt gca agt agg ett 1104 
Thr Lys Ser Val Val Met Thr Val Pro Ser Tyr Val Ala Ser Arg Leu 
355 360 365 

ett cgt cca ett tea gac tct get gca gat tct ett tea aaa ttt tae 1152 
Leu Arg Pro Leu Ser Asp Ser Ala Ala Asp Ser Leu Ser Lys Phe Tyr 
370 375 380 

tat cca cca gtt gca gca gtg tea ett tec tat cct aaa gaa gcg ate 1200 
Tyr Pro Pro Val Ala Ala Val Ser Leu Ser Tyr Pro Lys Glu Ala He 
385 390 395 400 

aga tea gaa tgc ttg att aat ggt gaa ett caa ggt ttc ggg caa eta 1248 
Arg Ser Glu Cys Leu He Asn Gly Glu Leu Gin Gly Phe Gly Gin Leu 
405 410 415 

cat ccc cgc agt cag ggt gtg gaa ace ttg gga aca att tat agt teg 1296 
His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser 
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420 425 430 

tct ctt ttc cct ggt cga gca cca cct ggt agg ate ttg ate ttg age 1344 
Sex Leu Phe Pro Gly Arg Ala Pro Pro Gly Arg lie Leu lie Leu Ser 
435 440 445 

tac ate gga ggt get aaa aat cct ggc ata tta aac aag teg aaa gat 1392 
Tyr lie Gly Gly Ala Lys Asn Pro Gly lie Leu Asn Lys Ser Lys Asp 
450 455 460 

gaa ctt gee aag aea gtt gac aag gae etg aga aga atg ctt ata aat 1440 
Glu Leu Ala Lys Thr Val Asp Lys Asp Leu Arg Arg Met Leu lie Asn 
465 470 475 480 

cct gat gca aaa ctt cct cgt gta ctg ggt gtg aga gta tgg cct caa 1488 
Pro Asp Ala Lys Leu Pro Arg Val Leu Gly Val Arg Val Trp Pro Gin 
485 490 495 

gca ata ece cag ttt tct att ggg cac ttt gat etg etc gat get gca 1536 
Ala He Pro Gin Phe Ser He Gly His Phe Asp Leu Leu Asp Ala Ala 
500 505 510 

aaa get get ctg aea gat aca ggg gtc aaa gga ctg ttt ctt ggt ggc 1584 
Lys Ala Ala Leu Thr Asp Thr Gly Val Lys Gly Leu Phe Leu Gly Gly 
515 520 525 

aac tat gtt tea ggt gtt gee ttg ggg egg tgt ata gag ggt get tat 1632 
Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys He Glu Gly Ala Tyr 
530 535 540 

gag tct gca get gag gta gta gat ttc etc tea cag tac tea gae aaa 1680 
Glu Ser Ala Ala Glu Val Val Asp Phe Leu Ser Gin Tyr Ser Asp Lys 
545 550 555 560 

tagagcttca gcatcctgtg taattcaaca caggectttt tgtatctgtt gtgcgcgcat 1740 

gtagtctggt cgtggtgcta ggattgatta gttgctctgc tgtgtgatcc acaagaattt 1800 

tgatggaatt tttccagatg tgggcattat atgttgctgt cttataaatc cttaatttgt 1860 

acgtttagtg aattacaccg catttgatga etaaaaaaaa aaaaaaaaaa 1910 



<210> 18 
<211> 560 
<212> PRT 

<213> Beta vulgaris 



<400> 18 

Met Lys Ser Met Ala Leu Ser Asn Cys He Pro Gin Thr Gin Cys Met 
15 10 15 

Pro Leu Arg Ser Ser Gly His Tyr Arg Gly Asn Cys He Met Leu Ser 
20 25 30 

He Pro Cys Ser Leu He Gly Arg Arg Gly Tyz Tyr Ser His Lys Lys 
35 40 45 
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Arg Arg Met Ser Met Ser Cys Ser Thr Ser Ser Gly Ser Lys Ser Ala 
50 55 60 

Val Lys Glu Ala Gly Ser Gly Ser Gly Ala Gly Gly Leu Leu Asp Cys 
65 70 75 80 

Val lie Val Gly Gly Gly He Ser Gly Leu Cys He Ala Gin Ala Leu 
85 90 95 

Cys Thr Lys His Ser Ser Ser Ser Leu Ser Pro Asn Phe He Val Thr 
100 105 110 

Glu Ala Lys Asp Arg Val Gly Gly Asn He Val Thr Val Glu Ala Asp 
115 120 125 

Gly Tyr He Trp Glu Glu Gly Pro Asn Ser Phe Gin Pro Ser Asp Ala 
130 135 140 

Val Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp Glu Leu Val Leu 
145 150 155 160 

Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Asn Asp Lys Leu Arg 
165 170 175 

Pro Val Pro Ser Ser Leu Thr Asp Leu Pro Phe Phe Asp Leu Met Thr 
180 185 190 

He Pro Gly Lys He Arg Ala Ala Leu Gly Ala Leu Gly Phe Arg Pro 
195 200 205 

Ser Pro Pro Pro His Glu Glu Ser Val Glu His Phe Val Arg Arg Asn 
210 215 220 

Leu Gly Asp Glu Val Phe Glu Arg Leu He Glu Pro Phe Cys Ser Gly 
225 230 235 240 

Val Tyr Ala Gly Asp Pro Ala Lys Leu Ser Met Lys Ala Ala Phe Gly 
245 250 255 

Lys Val Trp Lys Leu Glu Gin Lys Gly Gly Ser He He Gly Gly Thr 
260 265 270 

Leu Lys Ala He Gin Glu Arg Gly Ser Asn Pro Lys Pro Pro Arg Asp 
275 280 285 

Gin Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Gly Ser Phe Arg Lys 
290 295 300 

Gly Leu Val Met Leu Pro Thr Ala He Ser Ala Arg Leu Gly Ser Arg 
305 310 315 320 

Val Lys Leu Ser Trp Thr Leu Ser Ser He Val Lys Ser Leu Asn Gly 
325 330 335 



Glu Tyr Ser Leu Thr Tyr Asp Thr Pro Asp Gly Leu Val Ser Val Arg 
340 . 345 350 
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Thr Lys Ser Val Val Met Thr Val Pro Ser Tyr Val Ala Ser Arg Leu 
355 360 365 

Leu Arg Pro Leu Ser Asp Ser Ala Ala Asp Ser Leu Ser Lys Phe Tyr 
370 375 380 

Tyr Pro Pro Val Ala Ala Val Ser Leu Ser Tyr Pro Lys Glu Ala lie 
385 390 395 400 

Arg Ser Glu Cys Leu He Asn Gly Glu Leu Gin Gly Phe Gly Gin Leu 
405 410 415 

His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr He Tyr Ser Ser 
420 425 430 

Ser Leu Phe Pro Gly Arg Ala Pro Pro Gly Arg He Leu He Leu Ser 

435 440 445 

Tyr He Gly Gly Ala Lys Asn Pro Gly He Leu Asn Lys Ser Lys Asp 
450 455 460 

Glu Leu Ala Lys Thr Val Asp Lys Asp Leu Arg Arg Met Leu He Asn 
465 470 475 480 

Pro Asp Ala Lys Leu Pro Arg Val Leu Gly Val Arg Val Trp Pro Gin 
485 490 495 

Ala He Pro Gin Phe Ser He Gly His Phe Asp Leu Leu Asp Ala Ala 
500 505 510 

Lys Ala Ala Leu Thr Asp Thr Gly Val Lys Gly Leu Phe Leu Gly Gly 
515 520 525 

Asn Tyr Val Ser Gly Val Ala Leu Gly Arg Cys He Glu Gly Ala Tyr 
530 535 540 

Glu Ser Ala Ala Glu Val Val Asp Phe Leu Ser Gin Tyr Ser Asp Lys 
545 550 555 560 



<210> 19 
<211> 1784 
<212> DNA 

<213> Brassica napus 

<220> 
<221> CDS 

<222> (47).. (1654) 

<223> oilseed rape protox-1 coding sequence 
<400> 19 

gggccccccc caaaattgag gattctcctt ctcgcgggcg atcgcc atg gat tta 55 

Met Asp Leu 
1 
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tct ctt etc cgt ccg cag cca ttc eta teg cca tte tea aat eca ttt 103 
Ser Leu Leu Arg Pro Gin Pro Phe Leu Ser Pro Phe Ser Asn Pro Phe 
5 10 15 

act egg teg cgt ccc tac aag cct etc aac etc cgt tgc tee gta tee 151 
Pro Arg Ser Arg Pro Tyr Lys Pro Leu .Asn Leu Arg Cys Ser Val Ser 
20 25 30 35 

ggt gga tec gtc gte ggc tet tct aca ate gaa gge gga gga gga ggt 199 
Gly Gly Ser Val Val Gly Ser Ser Thr He Glu Gly Gly Gly Gly Gly 
40 45 50 

I 

aaa acc gtc acg gcg gac tgc gtg ate gte ggc gga gga ate age ggc 247 
Lys Thr Val Thr Ala Asp Cys Val He Val Gly Gly Gly He Ser Gly 
55 60 65 

ctg tgc att gcg caa gcg etc gtg acg aag cac cca gac get gca aag 295 
Leu Cys He Ala Gin Ala Leu Val Thr Lys His Pro Asp Ala Ala Lys 
70 75 80 

aat gtg atg gtg acg gag gcg aag gac cgt gtg gga ggg aat ate ate 343 
Asn Val Met Val Thr Glu Ala Lys Asp T^g Val Gly Gly Asn He He 
85 90 95 

acg cga gag gag caa ggg ttt eta tgg gaa gaa ggt ccc aat age ttt 391 
Thr Arg Glu Glu Gin Gly Phe Leu Trp Glu Glu Gly Pro Asn Ser Phe 
100 105 110 115 

cag ccg tct gat cct atg etc act atg gtg gta gat agt ggt ttg aaa 439 
Gin Pro Ser Asp Pro Met Leu Thr Met Val Val Asp Ser Gly Leu Lys 
120 125 130 

gat gat eta gtc ttg gga gat cct act get ccg agg ttt gtg ttg tgg 487 
Asp Asp Leu Val Leu Gly Asp Pro Thr Ala Pro Arg Phe Val Leu Trp 
135 140 145 

aat ggg aag ctg agg ccg gtt ccg teg aag eta act gac ttg cct ttc 535 
Asn Gly Lys Leu Arg Pro Val Pro Ser Lys Leu Thr Asp Leu Pro Phe 
150 155 160 

ttt gac ttg atg agt att gga ggg aag att aga get ggg ttt ggt gee 583 
Phe Asp Leu Met Ser He Gly Gly Lys He Arg Ala Gly Phe Gly Ala 
165 170 175 

att ggt att cga cct tea cct ccg ggt cgt gag gaa tea gtg gaa gag 631 
He Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu Ser Val Glu Glu 
180 185 190 195 

ttt gta agg cgt aat ctt ggt gat gag gtt ttt gag cgc ttg att gaa 679 
Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu Arg Leu He Glu 
200 205 210 

ccc ttt tgc tea ggt gtt tat gcg gga gat cct gcg aaa ctg agt atg 727 
Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ala Lys Leu Ser Met 
215 220 225 

aaa gca get ttt ggg aag gtt tgg aag eta gag gag aat ggt ggg age 775 
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Lys Ala Ala Phe Gly Lys Val Trp Lys Leu Glu Glu Asn Gly Gly Ser 
230 235 240 

ate att ggt ggt get ttt aag gca att caa gcg aaa aat aaa get ece 823 
lie He Gly Gly Ala Phe Lys Ala He Gin Ala Lys Asn Lys Ala Pro 
245 250 255 

aag aca ace cga gat ccg cgt ctg cca aag cca aag ggc caa act gtt 871 
Lys Thr Thr Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly Gin Thr Val 
260 265 270 275 

ggt tot ttc agg aaa gga etc aca atg ctg cca gag gca ate tec gca 919 
Gly Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Glu Ala He Ser Ala 
280 285 290 

agg ttg ggt gac aag gtg aaa gtt tet tgg aag etc tea agt ate act 967 
Arg Leu Gly Asp Lys Val Lys Val Ser Trp Lys Leu Ser Ser He Thr 
295 300 305 

aag ctg gee age gga gaa tat age tta act tac gaa act ccg gag ggt 1015 
Lys Leu Alia Ser Gly Glu Tyr Ser Leu Thr Tyr Glu Thr Pro Glu Gly 
310 • 315 320. 

ata gtc act gta cag age aaa agt gta gtg atg act gtg cca tet cat 1063 
He Val Thr Val Gin Ser Lys Ser Val Val Met Thr Val Pro Ser His 
325 330 335 

gtt get agt agt etc ttg cgc ect etc tet gat Z2t gca get gaa gcg 1111 
Val Ala Ser Ser Leu Leu Arg Pro Leu Ser Asp Ser Ala Ala Glu Ala 
340 345 350 , 355 

etc tea aaa etc tac tat ccg cca gtt gca gee gta tec ate tea tac 1159 
Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val Ser He Ser Tyr 
360 365 370 

gcg aaa ga^ gca ate cga age gaa tge tta ata gat ggt gaa eta aaa 1207 
Ala Lys Glu Ala He Arg Ser Glu Cys Leu He Asp Gly Glu Leu Lys 
375 380 385 

ggg ttc ggc cag ttg cat cca cgc aeg caa aaa gtg gaa act ctt gga 1255 
Gly Phe Gly Gin Leu His Pro Arg Thr Gin Lys Val Glu Thr Leu Gly 
390 395 400 

aca ata tac agt tea teg etc ttt ece aac cga gca ccg cet gga aga 1303 
Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Pro Gly TVrg 
405 410 415 

gta ttg eta ttg aac tac ate ggt gga get ace aac act ggg ate tta 1351 
Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Leu 
420 425 430 , 435 

tea aag teg gaa ggt gag tta gtg gaa gca gta gat aga gac ttg agg 1399 
Ser Lys Ser Glu Gly Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg 
440 445 450 

aag atg ctg ata aag cca age teg ace gat cca ctt gta ctt gga gta 1447 
Lys Met Leu He Lys Pro Ser Ser Thr Asp Pro Leu Val Leu Gly Val 
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455 460 465 

aaa tta tgg cct caa gcc att cct cag ttt ctg ata ggt cac att gat 1495 
Lys Leu Trp Pro Gin Ala lie Pro Gin Phe Leu lie Gly His lie Asp 
470 475 480 

ttg gta gac gca gcg aaa gca teg etc teg tea tet ggt cat gag ggc 1543 
Leu Val Asp Ala Ala Lys Ala Ser Leu Ser Ser Ser Gly His Glu Gly 
485 490 495 

tta ttc ttg ggt gga aat tac gtt gcc ggt gta gca ttg ggt egg tgt 1591 
Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys 
500 505 510 515 

gtg gaa ggt get tat gaa act gca acc caa gtg aat gat ttc atg tea 1639 
Val Glu Gly Ala Tyr Glu Thr Al^ Thr Gin Val Asn Asp Phe Met Ser 
520 525 530 

agg tat get tac aag taatgtaacg cagcaacgat ttgatactaa gtagtagatt 1694 
Arg Tyr Ala Tyr Lys 
535 

ttgcagtttt gactttaaga acactctgtt tgtgaaaaat tcaagtctgt gattgagtaa 1754 
atttatgtat tattactaaa aaaaaaaaaa 1784 



, <210> 20 
<211> 536 
<212> PRT 

<213> Brassica napus 
<400> 20 

Met Asp Leu Ser Leu Leu Arg Pro Gin Pro Phe Leu Ser Pro Phe Ser 
15 10 15 

Asn Pro Phe Pro Arg Ser Arg Pro Tyr Lys . Pro Leu Asn Leu Arg Cys 
20 25 30 

Ser Val Ser Gly Gly Ser Val Val Gly Ser Ser Thr He Glu Gly Glv 
35 40 45 

Gly Gly Gly Lys Thr Val Thr Ala Asp Cys Val He Val Gly Gly Glv 
50 55 60 

He Ser Gly Leu Cys He Ala Gin Ala Leu Val Thr Lys His Pro Asp 
^5 70 75 80 

Ala Ala Lys Asn Val Met Val Thr Glu Ala Lys Asp Arg Val Gly Gly 
85 90 95 

Asn He He Thr Arg Glu Glu Gin Gly Phe Leu Trp Glu Glu Gly Pro 
100 105 no 

Asn Ser Phe Gin Pro Ser Asp Pro Met Leu Thr Met Val Val Asp Ser 
115 120 125 
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Gly Leu Lys Asp Asp heu Val Leu Gly Asp Pro Thr Ala Pro Arg Phe 
130 135 140 

Val Leu Trp Asn Gly Lys Leu Arg Pro Val Pro Ser Lys Leu Thr Asp 
145 150 155 160 

Leu Pro Phe Phe Asp Leu Met Ser He Gly Gly Lys He Arg Ala Gly 
165 170 175 

Phe Gly Ala He Gly He Arg Pro Ser Pro Pro Gly Arg Glu Glu Ser 
180 185 190 

Val Glu Glu Phe Val Arg Arg Asn Leu Gly Asp Glu Val Phe Glu Arg 
195 200 205 

Leu He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ala Lys 
210 215 220 

Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Lys Leu Glu Glu Asn 
225 230 235 240 

Gly Gly Ser He He Gly Gly Ala Phe Lys Ala He Gin Ala Lys Asn 
245 250 255 

Lys Ala Pro Lys Thr Thr Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 
260 265 270 

Gin Thr Val Gly Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Glu Ala 
275 280 285 

He Ser Ala Arg Leu Gly Asp Lys Val Lys Val Ser Trp Lys Leu Ser 
290 295 300 

Ser He Thr Lys Leu Ala Ser Gly Glu Tyr Ser Leu Thr Tyr Glu Thr 
305 310 315 320 

Pro Glu Gly He Val Thr Val Gin Ser Lys Ser Val Val Met Thr Val 
325 330 335 

Pro Ser His Val Ala Ser Ser Leu Leu Arg Pro Leu Ser Asp Ser Ala 

340 . 345 350 

Ala Glu Ala Leu Ser Lys Leu Tyr Tyr Pro Pro Val Ala Ala Val Ser 
355 360 365 

He Ser Tyr Ala Lys Glu Ala He Arg Ser Glu Cys Leu He Asp Gly 
370 375 380 

Glu Leu Lys Gly Phe Gly Gin Leu His Pro Arg Thr Gin Lys Val Glu 
385 390 395 400 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
405 410 415 



Pro Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr 
420 425 430 
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Gly lie Leu Ser Lys Ser Glu Gly Glu Leu Val Glu Ala Val Asp Arg 
435 440 445 

Asp Leu Arg Lys Met Leu lie Lys Pro Ser Ser Thr Asp Pro Leu Val 
450 455 460 

Leu Gly Val Lys Leu Trp Pro Gin Ala lie Pro Gin Phe Leu lie Gly 
465 470 475 480 

His He Asp Leu Val Asp Ala Ala Lys Ala Ser Leu Ser Ser Ser Gly 
485 490 495 

His Glu Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
500 505 510 

Gly Arg Cys Val Glu Gly Ala Tyr Glu Thr Ala Thr Gin Val Asn Asp 
515 520 525 



Phe Met Ser Arg Tyr Ala Tyr Lys 
530 535 



<210> 21 
<211> 1224 
<212> DNA 

<213> Oryza sativa 

<220> 

<221> CDS 

<222> (1) . . (936) 

<223> rice protox-1 partial coding sequence 
<400> 21 

egg get ttg aag get gea ttt ggg aag gtg tgg agg ctg gag gat act 48 
Arg Ala Leu Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Asp Thr 
15 10 15 

gga ggt age att att ggt gga acc ate aag aca ate cag gag agg ggg 96 
Gly Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 
20 25 30 

aaa aac cee aaa ccg ceg agg gat cec ege ett eca acg eca aag ggg 144 
Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Thr Pro Lys Gly 
35 40 45 

eag aca gtt gea tct ttc agg aag ggt ctg act atg etc ccg gat get 192 
Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Asp Ala 
50 55 60 

att aca tct agg ttg ggt age aaa gtc aaa ett tea tgg aag ttg aca 240 
He Thr Ser Arg Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
65 70 75 80 

age att aca aag tea gae aac aaa gga tat gea tta gtg tat gaa aca 288 
Ser He Thr Lys Ser Asp Asn Lys Gly Tyr Ala Leu Val Tyr Glu Thr 
85 90 95 
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cca gaa ggg gtg gtc teg gtg caa get aaa act gtt gte atg acc ate 336 
Pro Glu Gly Val Val Ser Val Gin Ala Lys Thr Val Val Met Thr He 
100 105 110 

cca tea tat gtt get agt gat ate ttg egg cca ctt tea agt gat gca 384 
Pro Ser Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Ser Asp Ala 
115 120 125 

gca gat get ctg tea ata ttc tat tat cca cca gtt get get gta act 432 
Ala Asp Ala Leu Ser He Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 
130 135 140 

gtt tea tat cca aaa gaa gca att aga aaa gaa tgc tta att gac gga 480 
Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
145 150 155 160 

gag etc cag ggt ttc ggc cag ctg cat ccg cgt agt cag gga gtt gag 528 
Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
165 170 175 

act tta gga aca ata tat age tea tea etc ttt cca aat cgt get cca 576 
Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
180 185 ■ 190 

get gga agg gtg tta ctt ctg aac tac ata gga ggt tct aca aat aca 624 
Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn Thr 
195 200 205 

ggg att gtt tec aag act gaa agt gag ctg gta gaa gca gtt gac cgt 672 
Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
210 215 220 

gac etc agg aag atg ctg ata aat cet aga gca gtg gac ect ttg gtc 720 
Asp Leu Arg Lys Met Leu He Asn Pro Arg Ala Val Asp Pro Leu Val 
225 230 235 240 

ctt ggc gtc egg gta tgg cca caa gee ata cca cag ttc etc att ggc 768 
Leu Gly Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu He Gly 
245 250 255 

cat ctt gat cat ctt gag get gca aaa tct gee ctg ggc aaa ggt ggg 816 
His Leu Asp His Leu Glu Ala Ala Lys Ser Ala Leu Gly Lys Gly Gly 
260 265 270 

tat gat gga ttg ttc etc gga ggg aac tat gtt gca gga gtt gee ctg 864 
Tyr Asp Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
275 280 285 

ggc cga tgc gtt gaa ggt gca tat gag agt gee tea caa ata tct gac 912 
Gly Arg Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp 
290 295 300 

tac ttg acc aag tac gcc tac aag tgatcaaagt tggcctgctc ettttggcac 966 
Tyr Leu Thr Lys Tyr Ala Tyr Lys 
305 310 
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atagatgtga ggcttctagc agcaaaaatt tcatgggcat ctttttatcc tgattctaat 1026 

tagttagaat ttagaattgt agaggaatgt tccatttgca gttcataata gttgttcaga 1086 

tttcagccat tcaatttgtg cagccattta ctatatgtag tatgatcttg taagtactac 1146 

taagaacaaa tcaattatat tttcctgcaa gtgacatctt aatcgtcagc aaatccagtt 1206 

actagtaaaa aaaaaaaa 1224 



<210> 22 
<211> 312 
<212> PRT 

<213> Oryza sativa 
<400> 22 

Arg Ala Leu Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Asp Thr 
15 10 15 

Gly Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 
20 25 30 

Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Thr Pro Lys Gly 

35 40 45 

Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Thr Met Leu Pro Asp Ala 
50 55 60 

He Thr Ser Arg Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
65 70 75 80 

Ser He Thr Lys Ser Asp Asn Lys Gly Tyr Ala Leu Val Tyr Glu Thr 
85 90 95 

Pro Glu Gly Val Val Ser Val Gin Ala Lys Thr Val Val Met Thr He 
100 105 110 

Pro Ser Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Ser Asp Ala 
115 120 125 

Ala Asp Ala Leu Ser He Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 

130 135 140 

Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
145 150 155 160 

Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
165 170 175 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
180 185 190 

Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ser Thr Asn Thr 
195 200 205 

Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
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210 



215 



220 



Asp Leu Arg Lys Met Leu lie Asn Pro Arg Ala Val Asp Pro Leu Val 
225 230 235 240 

Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu lie Gly 
245 250 255 

His Leu Asp His Leu Glu Ala Ala Lys Ser Ala Leu Gly Lys Gly Gly 
260 265 270 

Tyr Asp Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
275 280 285 

Gly Arg Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin lie Ser Asp 
290 295 300 



Ty^^ Leu Thr Lys Tyr Ala Tyr Lys 
305 310 



<210> 23 
<211> 1590 
<212> DNA 

<213> Sorghum bicolor 

<220> 
<221> CDS 

<222> (1)..(1320) f 

<223> sorghum protox-1 partial coding sequence 

<400> 23 

tec acc gtc gag cgc ccc gag gaa ggg tac etc tgg gag gag ggt ccc 48 
Ser Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro 
15 10 15 

aac age ttc cag cca tec gac ccc gtt etc tec atg gee gtg gac age 96 
Asn Ser Phe Gin Pro Ser Asp Pro Val Leu Ser Met Ala Val Asp Ser 
20 25 30 

ggg ctg aag gat gac ctg gtt ttt ggg gac ccc aac gcg cca egg ttc 144 
Gly Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe 
35 40 45 

gtg ctg tgg gag ggg aag ctg agg ccc gtg cca tec aag ccc gee gac 192 
Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp 
50 55 60 

etc ccg ttc ttc gat etc atg age ate eet ggc aag etc agg gee ggt 240 
Leu Pro Phe Phe Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly 
65 70 75 80 

etc ggc gcg ctt ggc ate cgc ccg eet get cca ggc cgc gag gag tea 288 
Leu Gly Ala Leu Gly He Arg Pro Pro Ala Pro Gly Arg Glu Glu Ser 
85 90 95 
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gtg gag gag ttt gtg cgc cgc aac etc ggt get gag gtc ttt gag cgc 336 
Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg 
100 105 110 

eta att gag cct ttc tgc tea ggt gte tat get ggc gat cct tee aag 384 
Leu lie Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys 
115 120 125 

cte agt atg aag get gea ttt ggg aag gtg tgg egg tta gaa gaa get 432 
Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Ala 
130 135 140 

gga ggt agt att att ggt gga acc ate aag acg att cag gag agg ggc 480 
Gly Gly Ser lie lie Gly Gly Thr He Lys Thr He Gin Glu Arg Gly 
145 150 155 160 

aag aat cca aaa cea eeg agg gat eee cgc ctt ccg aag cca aaa ggg 528 
Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 
165 170 175 

cag aca gtt gea tct ttc agg aag ggt ctt gee atg ett cea aat gee 576 
Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala 
180 185 190 

ate aca tec age ttg ggt agt aaa gte aaa eta tea tgg aaa cte acg 624 
He Thr Ser Ser Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
195 200 205 

age atg aca aaa tea gat gge aag ggg tat gtt ttg gag tat gaa aca 672 
Ser Met Thr Lys Ser Asp Gly Lys Gly Tyr Val Leu Glu Tyr Glu Thr 
210 215 220 

cca gaa ggg gtt gtt ttg gtg cag get aaa agt gtt ate atg acc att 720 
Pro Glu Gly Val Val Leu Val Gin Ala Lys Ser Val He Met Thr He 
225 230 235 240 

cca tea tat gtt get age gae att ttg cgt cea ett tea ggt gat get 768 
Pro Ser Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Gly Asp Ala 
245 250 255 

gea gat gtt eta tea aga ttc tat tat cca cca gtt get get gta acg 816 
Ala Asp Val Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 
260 265 270 

gtt teg tat cea aag gaa gea att aga aaa gaa tgc tta att gat ggg 864 
Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
275 280 285 

gaa etc cag ggt ttt ggc cag ttg cat cca cgt agt caa gga gtt gag 912 
Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
290 295 300 

aca tta gga aca ata tac age tea tea etc ttt cca aat cgt get cct 960 
Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
305 310 315 320 

get ggt agg gtg tta ctt eta aac tac ata gga ggt get aca aac aca 1008 
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Ala Gly Arg Val Leu Leu Leu Asn Tyr lie Gly Gly Ala Thr Asn Thr 
325 330 335 

gga att gtt tec aag act gaa agt gag ctg gta gaa gca gtt gac cgt 1056 
Gly lie Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
340 345 350 

gac etc cga aaa atg ett ata aat cct aea gca gtg gac cct tta gtc 1104 
Asp Leu Arg Lys Met Leu lie Asn Pro Thr Ala Val Asp Pro Leu Val 
355 360 365 

ett ggt gtc cga gtt tgg cca caa gcc ata cct cag ttc ctg gta gga 1152 
Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly 
370 375 ^ 380 

cat ett gat ett ctg gag gee gca aaa tct gcc ctg gac caa ggt ggc 1200 
His Leu Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Gin Gly Gly 
385 390 395 400 

tat aat ggg ctg ttc eta gga ggg aac tat gtt gca gga gtt gcc ctg 1248 
Tyr Asn Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu 
405 410 415 

ggc aga tgc att gag ggc gca tat gag agt gee gcg caa ata tat gac 1296 
Gly Arg Cys lie Glu Gly Ala Tyr Glu Ser Ala Ala Gin lie Tyr Asp 
420 425 430 

ttc ttg acc aag tac gcc tac aag tgatggaaga agtggagcge tgcttgttaa 1350 
Phe Leu Thr Lys Tyr Ala Tyr Lys 
435 440 

ttgttatgtt gcatagatga ggtgagaeca ggagtagtaa aaggegtcac gagtattttt 1410 

eattcttatt ttgtaaattg cacttctgtt tttttttcct gtcagtaatt agttagattt 1470 

tagttatgta ggagattgtt gtgttcactg ccctacaaaa gaatttttat tttgcattcg 153 0 

tttatgagag ctgtgcagac ttatgtaacg ttttactgta agtatcaaca aaatcaaata 1590 



<210> 24 
<211> 440 
<212> PRT 

<213> Sorghum bicolor 
<400> 24 

Ser Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro 
15 10 15 

Asn Ser Phe Gin Pro Ser Asp Pro Val Leu Ser Met Ala Val Asp Ser 
20 25 30 

Gly Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe 
35 40 45 

Val Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp 
50 55 60 
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Leu Pro Phe Phe Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly 
65 70 75 80 

Leu Gly Ala Leu Gly lie Arg Pro Pro Ala Pro Gly Arg Glu Glu Ser 
85 90 95 

Val Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg 
100 105 110 

Leu lie Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys 
115 120 125 

Leu Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Ala 
130 135 140 

Gly Gly Ser He lie Gly Gly Thr He Lys Thr. He Gin Glu Arg Gly 
145 150 155 160 

Lys Asn Pro Lys Pro Pro Arg Asp Pro Arg Leu Pro Lys Pro Lys Gly 
165 170 175 

Gin Thr Val Ala Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala 
180 185 190 

He Thr Ser Ser Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr 
195 200 205 

Ser Met Thr Lys Ser Asp Gly Lys Gly Tyr Val Leu Glu Tyr Glu Thr 
210 215 220 

Pro Glu Gly Val Val Leu Val Gin Ala Lys Ser Val He Met Thr He 
225 230 235 240 

Pro Ser Tyr Val Ala Ser Asp He Leu Arg Pro Leu Ser Gly Asp Ala 
245 250 255 

Ala Asp Val Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr 
260 265 270 

Val Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly 
275 280 285 

Glu Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu 
290 295 300 

Thr Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro 
305 310 315 320 

Ala Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr 
325 330 335 

Gly He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg 
340 345 350 



Asp Leu Arg Lys Met Leu He Asn Pro Thr Ala Val Asp Pro Leu Val 
355 360 365 
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Leu Gly Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly 
370 375 380 

His Leu Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Gin Gly Gly 
385 390 395 400 

Tyr Asn Gly Leu Phe Leu Gly Gly. Asn Tyr Val Ala Gly Val Ala Leu 

405 410 415 

Gly Arg Cys He Glu Gly Ala Tyr Glu Ser Ala Ala Gin He Tyr Asp 
420 425 430 



Phe Leu Thr Lys Tyr Ala Tyr Lys 
435 440 



<210> 25 
<211> 93 
<212> DNA 
<213> Zea mays 

<220> 

<221> misc_f eature 
<222> (1) . . (93) 

<223> maize protox-1 intron sequence 
<400> 25 

gtacgctcct cgctggcgcc gcagcgtctt cttctcagac tcatgcgcag ccatggaatt 60 
gagatgctga atggatttta tacgcgcgcg cag 93 



<210> 26 
<211> 2606 
<212> DNA 

<213> Beta vulgaris 
<220> 

<221> inisc_feature 
<222> (2601) . . (2606) 
<223> Sail site 

<220> 

<221> misc^feature 
<222> (1)..(538) 

<223> partial c-DNA of sugar beet protox-1 

<220> 

<221> mis cofeature 
<222> (539) . , (2606) 

<223> sugar beet protox-1 promoter region (partial 

sequence of the -3kb Pstl-Sall fragment subcloned 
from pWDC-20) 

<400> 26 

ctgcaggggg agggaaagag agaccgcgac ggtgagggag gggagaccgc gacggtgagg 60 
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gaggggagaa cgcgacggtg agggagggga gaacgcgatg gtgagggagg ggagaacgcg 120 
acgcgcaggg gagggggata actcgacggt gcagggaggt gagggggacg acgtgacggc 180 
gcaggggagg ggggaaccgt cgcgggaagg ggaagaccgg ggggccgaca aggtggtgtt 240 
actggggtag ggagaggcgg cgtggagaat agtaacagag ggaggagtgg tggtgctagg 300 
gtggaagaag ggtaagaaag aggaagaaag agaattaaca ttatcttaac caaacaccac 360 
tctaaatcta agggttttct tttcctttcc tctcctctcc ctttcttgat tccattccct 420 
ttaccccgtt gcaaccaaac gcccccttat tatggaccgg aggaagtatg tagagatggt 480 
cacaaaacta cttaagctgg taacttataa atatactggg tattaaatga attaagtggc 540 
cacaaaatga ctataaatta cttcgtaatc tttaggaact atgttggtca cgaaataaca 600 
taaaactggt tatttaatgg ctttatgtag gtactgcatt cataaatata tttctaacat 660 
aatcgtggta tgtaggtgtt ttataacaca aggattaggt ttacaccaat gtcattttca 720 
ttagaatgta gttagaatca ctttggaact ttgaagagtg atgacacatt tttattatgc 780 
ttttatgaaa tgtctttgtg gtttttatga tagtattgag tttaaggcaa gttggaagta 840 
tatgatggag aagtacagta tataggtgac aattggtttg cttgtttcta tgagttgaaa 900 
gataagtagt acacgacact gagcaatgac ctcttcttag ttgtaatttt gtcttctcga 960 
cgtagtgaaa gtacaaacaa gattatggct ttcaagcttc caagataacg agattgtatg 1020 
aattttgtgg tgtatttcac atcattgttt tacgttggag acaaactaaa accaatgatg 1080 
agtttgtgga ttcgagattt gcccctaagt cttatttacc catggcaagc atgctgaaac 1140 
atgttagtca aacttacaca gctacaatgt ttagggattt tgagcaaaaa atttgggtat 1200 
tctttgggta ccattatgtg agttgttgac tatggattaa acaaaatcac tatataaagt 1260 
ctggaatgag aagcatccgc aattgacaca ccatgttact ttgattgttt caacaagttt 1320 
attagatgta tttgtaggaa ttttgaagag gcggagatgt tgtgttataa ttgctttggg 1380 
ggtgcttcac atgcactctg ttagtgagac atcttcagct tatattttaa ggcggttagt 1440 
gagtatgatt tttttttttc aaacttttcg atttccatgt aattaaaaaa ggtgtttgat 1500 
aaatacatgt taagatagcc aagaaaaggc aactttcaaa caaataaaaa aaattaagtc 1560 
gcttaatcat ttttccaagt actttttact tttaacacca cttattactg aatctatagc 1620 
cgttaagaat gcattttcac gctcatacat gcaaatcaag aacctcctca ttgaaggaga 1680 
taatttagtc ctcataaacc ccgttaaaga catttttagc atccagagaa atttcgattc 1740 
agttaaaatt gcatatataa ccagagaaac aaattcagat gttagtcagt ccagctacat 1800 
aggtcaatgc ctgagagttt aaaagaatcc gtatccttaa gcataagtag gtattgaggt 1860 
gagttacaaa ggtaagttac cggttacgca ccacctccac caaacaagta tggttagaag 1920 
atacatgtaa tcgtttattt agagtactat ttataaaaaa ctttttaact agaaacagtt 1980 
gtttcatttt gatataaggt taattagaat tcccgagcaa gcaagaaggg gatatagagg 2040 
ataaggaggg cgagagagcg agagagagat gaaatcaatg gcgttatcaa actgcattcc 2100 
acagacacag tgcatgccat tgcacagcag cgggcattac aggggcaatt gtatcatgtt 2160 
gtcaattcca tgtagtttaa ttggaagacg aggttattat tcacataaga agaggaggat 2220 
gagcatgagt tgcagcacaa gctcaggctc aaagtcagcg gttaaagaag caggatcagg 2280 
atcaggatca ggagcaggag gattgctaga ctgcgtaatc gttggaggtg gaattagcgg 2340 
gctttgcatc gcgcaggctc tttgtacaaa acagtcctct ttatccccaa attttatagt 2400 
gacagaggcc aaagacagag ttggcggcaa catcgtcact gtggaggccg atggctatat 2460 
ctgggaggag ggacccaata gcttccagcc ttccgacgcg gtgctcacca tggcggtaat 2520 
tctgtctctt cattattcat aatcataatt caattcaatt caattcctaa cgtggaatgt 2580 
ggaatgtggc atgtgcgtag gtcgac 2606 



<210> 27 
<211> 31 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: Pclp_Pla - 

plastid clpP gene promoter top strand PGR primer 

<220> 

<221> misc_feature 
<222> (4) . . (9) 
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<223> EcoRI restriction site 

<400> 27 

gcggaattca tacttattta tcattagaaa g 



<210> 28 
<211> 32 
<212> DNA 
<213> Artificial 



Sequence 



<220> 

<223> Description of Artificial Sequence: Pclp_Plb - 
plastid clpP gene promoter bottom strand PGR 
primer 



<220> 

<221> inisc_f eature 

<222> (4) . . (9) 

<223> Xbal restriction site 



<400> 28 

gcgtctagaa agaactaaat actatatttc ac 



<210> 29 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Pclp_P2b - 
plastid clpP gene promoter bottom strand PGR 
primer 

<220> 

<221> misc_feature 

<222> (4) . . (9) 

<223> Ncol restriction site 

<400> 29 

gcgccatggt aaatgaaaga aagaactaaa 



<210> 30 
<211> 30 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: Trpsl6_Pla 
plastid rpsie gene 3' untranslated region 
Xbal/Hindlll top strand PGR primer 

<220> 

<221> misc_feature 
<222> (4) . . (9) 
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<223> Xbal restriction site 
<400> 30 

gcgtctagat caaccgaaat tcaattaagg 

<210> 31 
<211> 27 
<212> DMA 

<213> Artificial Sequence 



30 



<220> 

<223> Description of Artificial Sequence: Trpsl6_plb - 
plastid rpsl6 gene 3' untranslated region 
Xbal/Hindlll bottom strand PGR 
primer 

<220> 

<221> misc_feature 
<222> (4).. (9) 

<223> Hindlll restriction site 
<400> 31 

cgcaagcttc aatggaagca atgataa 27 

<210> 32 
<211> 36 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: minpsb_U - 
plastid psbA gene 5' untranslated region 38 nt 
(blxint/NcoI) including ATG start codon, top strand 
primer 

<400> 32 

gggagtccct gatgattaaa taaaccaaga ttttac 36 

<210> 33 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: minpsb_L - 
plastid psbA gene 5' untranslated region 38 nt 
(blunt /Ncol) including ATG start codon (bottom 
strand primer) 

<400> 33 

catggtaaaa tcttggttta tttaatcatc agggactccc 40 
<210> 34 
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<211> 32 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: APRTXPla - top 
strand PGR primer for amplifying the 5' portion of 
the mutant Arabidopsis protox gene 

<220> 

<221> misc_feature 
<222> (5) . . (10) 

<223> Ncol restriction site/ATG start codon 
<400> 34 

gggaccatgg attgtgtgat tgtcggcgga gg 32 



<210> 35 
<211> 24 
<212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: APRTXPlb - 
bottom strand PGR primer for amplifying the 5' 
portion of the mutant Arabidopsis protox gene 

<400> 35 

ctccgctctc cagcttagtg atac 24 



<210> 36 
<211> 633 
<212> DNA 
<213> sugar cane 

<220> 

<221> CDS 

•<222> (3) . . (305) 

<223> sugar cane protox~l partial coding sequence 
<400> 36 

tt tec aag act gaa agt gag ctg gta gaa gca gtt gac cgt gac etc 47 
Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu 
15 10 15 

egg aaa atg ctt ata aat cct aca gca gtg gac cct tta gtc ctt ggt 95 
Arg Lys Met Leu lie Asn Pro Thr Ala Val Asp Pro Leu Val Leu Gly 
20 25 30 

gtc cga gtt tgg cca caa gcc ata cct cag ttc ctg gta gga cat ctt 143 
Val Arg Val Trp Pro Gin Ala lie Pro Gin Phe Leu Val Gly His Leu 
35 40 45 

gat ctt ctg gag gcc gca aaa tct gcc ctg gac cga ggt ggc tac gat 191 
Asp Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Arg Gly Gly Tyr Asp 
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50 55 60 

ggg ctg ttc eta gga ggg aac tat gtt gca gga gtt gcc eta ggc aga 239 
Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg 
65 70 75 

tgc gtt gag ggc gcg tat gag agt gcc teg caa ata tat gac ttc ttg 287 
Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Tyr Asp Phe Leu 
80 85 90 95 

acc aag tat gcc tac aag tgatgaaaga agtggagtgc tgcttgttaa 335 
Thr Lys Tyr Ala Tyr Lys 
100 

ttgttatgtt gcatagatga ggtgagacca ggagtagtaa aagcgttacg agtatttttc 395 

attcttattt tgtaaattgc acttctggtt ttttcctgtc agtaattagt tagattttag 455 

ttctgtagga gattgttctg ttcactgccc tacaaaagaa tttttatttt gcattcgttt 515 

atgagagctg tgcagactta tgtagcgttt ttctgtaagt accaacaaaa tcaaatacta 575 

ttctgtaaga gctaacagaa tgtgcaactg agattgcctt ggatgaaaaa aaaaaaaa 633 

<210> 37 
<211> 101 
<212> PRT 
<213> sugar cane 

<400> 37 

Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg 
15 10 15 

Lys Met Leu He Asn Pro Thr Ala Val Asp Pro Leu Val Leu Gly Val 
20 25 30 

Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp 
35 40 45 

Leu Leu Glu Ala Ala Lys Ser Ala Leu Asp Arg Gly Gly Tyr Asp Gly 
50 55 60 

Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys 
65 70 75 80 

Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Tyr Asp Phe Leu Thr 
85 90 95 

Lys Tyr Ala Tyr Lys 
100 



<210> 38 
<211> 4 
<212> PRT 
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<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: sub-sequence 1 

<400> 38 
Ala Pro Xaa Phe 
1 



<210> 39 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 8 
<400> 39 

Tyr He Gly Gly Xaa 
1 5 



<210> 40 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 
12 

<400> 40 
He Gly Gly Xaa 
1 



<210> 41 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: sub-sequence 
13 

<400> 41 

Ser Trp Xaa Leu Xaa 
1 5 



<210> 42 
<211> 5 
<212> PRT 

<213> Artificial Sequence 
<220> 



55 



wo 01/68826 



PCT/EPOl/02753 



<223> Description o£ Artificial Sequence: sub-sequence 
15 

<400> 42 

Gly Xaa Xaa Gly Leu 
1 5 



<2ia> 43 
<211> 4 
<212> PRT 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: s\ib-sequence 
17 

<400> 43 
Tyr Val Xaa Gly 
1 



<210> 44 
<211>- 1503 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . (1503) 

<223> coding sequence for a mature maize protox-1 protein 
<400> 44 

get get gtg gcg ggc 
Ala Ala Val Ala Gly 
1 5 

egg ctg tee gcg gac 
Arg Leu Ser Ala Asp 
20 

tgc ace gcg cag gcg 
Cys Thr Ala Gin Ala 
35 . 

gtc acg gag gee cgc 
Val Thr Glu Ala Arg 
50 

cgc ccc gag gaa ggg 
Arg Pro Glu Glu Gly 

65 

ccc tec gac ccc gtt 
Pro Ser Asp Pro Val 
85 



ggc gcg gee gag gca ceg gea tee ace ggc gcg 48 
Gly Ala Ala Glu Ala Pro Ala Ser Thr Gly Ala 
10 15 

tgc gtc gtg gtg ggc gga ggc ate agt ggc etc 96 
Cys Val Val Val Gly Gly Gly He Ser Gly Leu 
25 30 

ctg gee acg egg cac ggc gtc ggg gac gtg ctt 144 
Leu Ala Thr Arg His Gly Val Gly Asp Val Leu 
40 45 

gee cgc ccc ggc ggc aac att ace ace gtc gag 192 
Ala Arg Pro Gly Gly Asn He Thr Thr Val Glu 
55 60 

tac etc tgg gag gag ggt ccc aac age ttc cag 240 
Tyr Leu Trp Glu Glu Gly Pro Asn Ser Phe Gin 
70 75 80 

etc ace atg gee gtg gac age gga ctg aag gat 288 
Leu Thr Met Ala Val Asp Ser Gly Leu Lys Asp 
90 95 
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gac ttg gtt ttt ggg gac cca aac gcg ccg cgt ttc gtg ctg tgg gag 336 
Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val Leu Trp Glu 
100 105 110 

ggg aag ctg agg ccc gtg cca tec aag ccc gcc gac etc ccg ttc ttc 384 
Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu Pro Phe Phe 
115 120 125 

gat etc atg age ate cca ggg aag etc agg gcc ggt eta ggc gcg ett 432 
Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu Gly Ala Leu 
130 135 140 

ggc ate cgc ccg ect cet cca ggc cgc gaa gag tea gtg gag gag ttc 480 
Gly He Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val Glu Glu Phe 
145 150 155 160 

gtg cgc cgc aac etc ggt get gag gtc ttt gag cgc etc att gag cet 528 
Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu He Glu Pro 
165 170 175 

ttc tgc tea ggt gtc tat get ggt gat cet tet aag etc age atg aag 576 
Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu Ser Met Lys 
180 185 190 

get gca ttt ggg aag gtt tgg egg ttg gaa gaa act gga ggt agt att 624 
Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly Gly Ser He 
195 200 205 

att ggt gga ace ate aag aca att cag gag agg age aag aat cca aaa 672 
He Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys Asn Pro Lys 
210 215 220 

cca ccg agg gat gcc cgc ett ccg aag cca aaa ggg cag aca gtt gca 720 
Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin Thr Val Ala 
225 230 235 240 

tet ttc agg aag ggt ett gcc atg ett cca aat gcc att aca tec age 768 
Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He Thr Ser Ser 
245 250 255 

ttg ggt agt aaa gtc aaa eta tea tgg aaa etc aeg age att aca aaa 816 
Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu Thr Ser He Thr Lys 
260 265 270 

tea gat gac aag gga tat gtt ttg gag tat gaa aeg cca gaa ggg gtt 864 
Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro Glu Gly Val 
275 280 285 

gtt teg gtg cag get aaa agt gtt ate atg act att cca tea tat gtt 912 
Val Ser Val Gin Ala Lys Ser Val He Met Thr He Pro Ser Tyr Val 
290 295 • 300 

get age aac att ttg cgt cca ett tea age gat get gca gat get eta 960 
Ala Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala Asp Ala Leu 
305 310 315 320 

tea aga ttc tat tat cca ccg gtt get get gta act gtt teg tat cca 1008 
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Ser Arg Phe Tyr Tyx Pro Pro Val Ala Ala Val Thr Val Ser Tyr Pro 
325 330 335 

aag gaa gca att aga aaa gaa tgc tta att gat ggg gaa etc cag ggc 1056 

Lys Glu Ala lie Arg Lys Glu Cys Leu lie Asp Gly Glu Leu Gin Gly 
340 345 350 

ttt ggc cag ttg cat cca cgt agt caa gga gtt gag aca tta gga aca 1104 

Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr Leu Gly Thr 
355 360 365 

ata tac agt tec tea etc ttt cca aat cgt get cct gac ggt agg gtg 1152 

lie Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp Gly Arg Val 
370 375 380 

tta ctt eta aac tac ata gga ggt get aca aae aca gga att gtt tec 1200 

Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly He Val Ser 

385 390 395 400 

aag act gaa agt gag ctg gtc gaa gca gtt gac cgt gac etc cga aaa 1248 

Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp Leu Arg Lys 
405 410 415 

atg ctt ata aat tct aca gca gtg gac cct tta gtc ctt ggt gtt cga 1296 

Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu Gly Val Arg 
420 425 430 

gtt tgg cca caa gcc ata cct cag ttc ctg gta gga cat ctt gat ctt 1344 

Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His Leu Asp Leu 
435 440 445 

ctg gaa gcc gca aaa get gcc ctg gac cga ggt ggc tac gat ggg ctg 1392 

Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr Asp Gly Leu 
450 455 460 

ttc eta gga ggg aac tat gtt gca gga gtt gcc ctg ggc aga tgc gtt 1440 

Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly Arg Cys Val 

465 470 475 480 

gag ggc gcg tat gaa agt gcc teg caa ata tct gac ttc ttg acc aag 1488 

Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe Leu Thr Lys 
485 490 495 



tat gcc tac aag tga 
Tyr Ala Tyr Lys 
500 



1503 



<210> 45 
<211> 500 
<212> PRT 
<213> Zea mays 

<400> 45 

Ala Ala Val Ala Gly Gly Ala Ala Glu Ala Pro Ala Ser Thr Gly Ala 

1 5 10 . 15 

Arg Leu Ser Ala Asp Cys Val Val Val Gly Gly Gly He Ser Gly Leu 
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20 



Cys 


Thr 


Ala 


Gin 


Ala 


Leu 


Ala 


Thr 






35 










40 


Val 


Thr 


Glu 


Ala 


Arg 


Ala 


Arg 


Pro 




50 










55 




Arg 


Pro 


Glu 


Glu 


Gly 


Tyr 


Leu 


Trp 


65 










70 






Pro 


Ser 


Asp 


Pro 


Val 


Leu 


Thr 


Met 










85 








Asp 


Leu 


Val 


Phe 


Gly 


Asp 


Pro 


Asn 








100 










Gly 


Lys 


Leu 


Arg 


Pro 


Val 


Pro 


Ser 






115 










120 


Asp 


Leu 


Met 


Ser 


lie 


Pro 


Gly 


Lys 




130 










135 




Gly 


lie 


Arg 


Pro 


Pro 


Pro 


Pro 


Gly 


145 










150 






Val 


Arg 


Arg 


Asn 


Leu 


Gly 


Ala 


Glu 










165 








Phe 


Cys 


Ser 


Gly 


Val 


Tyr 


Ala 


Gly 








180 










Ala 


Ala 


Phe 


Gly 


Lys 


Val 


Trp 


Arg 






195 










200 


lie 


Gly 


Gly 


Thr 


lie 


Lys 


Thr 


He 




210 










215 




Pro 


Pro 


Arg 


Asp 


Ala 


Arg 


Leu 


Pro 


225 










230 






Ser 


Phe 


Arg 


Lys 


Gly 


Leu 


Ala 


Met 










245 








Leu 


Gly 


Ser 


Lys 


Val 


Lys 


Leu 


Ser 








260 










Ser 


Asp 


Asp 


Lys 


Gly 


Tyr 


Val 


Leu 






275 










280 


Val 


Ser 


Val 


Gin 


Ala 


Lys 


Ser 


Val 




290 










295 




Ala 


Ser 


Asn 


lie 


Leu 


Arg 


Pro 


Leu 


305 










310 






Ser 


Arg 


Phe 


Tyr 


Tyr 


Pro 


Pro 


Val 










325 








Lys 


Glu 


Ala 


lie 


Arg 


Lys 


Glu 


Cys 








340 










Phe 


Gly 


Gin 


Leu 


His 


Pro 


Arg 


Ser 






355 










360 


lie 


Tyr 


Ser 


Ser 


Ser 


Leu 


Phe 


Pro 




370 










375 




Leu 


Leu 


Leu 


Asn 


Tyr 


lie 


Gly 


Gly 


385 










390 






Lys 


Thr 


Glu 


Ser 


Glu 


Leu 


Val 


Glu 










405 








Met 


Leu 


lie 


Asn 


Ser 


Thr 


Ala 


Val 








420 










Val 


Trp 


Pro 


Gin 


Ala 


lie 


Pro 


Gin 






435 










440 


Leu 


Glu 


Ala 


Ala 


Lys 


Ala 


Ala 


Leu 




450 










455 




Phe 


Leu 


Gly 


Gly 


Asn 


Tyr 


Val 


Ala 


465 










470 







25 




30 






Arg 


His Gly Val 


Gly Asp 


Val 


Leu 






45 






Gly 


Gly Asn He 


Thr Thr 


Val 


Glu 




60 








Glu 


Glu Gly Pro 


Asn Ser 


Phe 


Gin 




75 






80 


Ala 


Val Asp Ser 


Gly Leu 


Lys 


Asp 




90 




95 




Ala 


Pro Arg Phe 


Val Leu 


Trp 


Glu 


105 




110 






Lys 


Pro Ala Asp 


Leu Pro 


Phe 


Phe 






125 






Leu 


Arg Ala Gly 


Leu Gly 


Ala 


Leu 




140 








Arg 


Glu Glu Ser 


Val Glu 


Glu 


Phe 




155 






160 


Val 


Phe Glu Arg 


Leu He 


Glu 


Pro 




170 




175 




Asp 


Pro Ser Lys 


Leu Ser 


Met 


Lys 


185 




190 






Leu 


Glu Glu Thr 


Gly Gly 


Ser 


He 






205 






Gin 


Glu Arg Ser 


Lys Asn 


Pro 


Lys 




220 








Lys 


Pro Lys Gly 


Gin Thr 


Val 


Ala 




235 






240 


Leu 


Pro Asn Ala 


He Thr 


Ser 


Ser 




250 




255 




Trp 


Lys Leu Thr 


Ser He 


Thr 


Lys 


265 




270 






Glu 


Tyr Glu Thr 


Pro Glu 


Gly 


Val 






285 






He 


Met Thr He 


Pro Ser 


Tyr 


Val 




300 








Ser 


Ser Asp Ala 


Ala Asp 


Ala 


Leu 




315 






320 


Ala 


Ala Val Thr 


Val Ser 


Tyr 


Pro 




330 




335 




Leu 


He Asp Gly 


Glu Leu 


Gin 


Gly 


345 




350 






Gin 


Gly Val Glu 


Thr Leu 


Gly 


Thr 






365 






Asn 


Arg Ala Pro 


Asp Gly 


Arg 


Val 




380 








Ala 


Thr Asn Thr 


Gly He 


Val 


Ser 




395 






400 


Ala 


Val Asp Arg 


Asp Leu 


Arg 


Lys 




410 




415 




Asp 


Pro Leu Val 


Leu Gly 


Val 


Arg 


425 




430 






Phe 


Leu Val Gly 


His Leu 


Asp 


Leu 






445 






Asp 


Arg Gly Gly 


Tyr Asp 


Gly 


Leu 




460 








Gly 


Val Ala Leu 


Gly Arg 


Cys 


Val 




475 






480 
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Glu Gly Ala Tyr Glu Ser Ala Ser Gin lie Ser Asp Phe Leu Thr Lys 
485 490 495 

Tyr Ala Tyr Lys 
500 



<210> 46 
<211> 1608 
<212> DNA 
<213> Zea mays 

<220> 

<221> CDS 

<222> (1) . . (1608) 

<223> predicted coding sequence for a maize protox-1 polypeptide precursor 

<400> 46 

atg gtc gcc gcc aca gcc acc gcc atg gcc acc get gca teg ccg eta 48 
Met Val Ala Ala Thr Ala Thr Ala Met Ala Thr Ala Ala Ser Pro Leu 
15 10 15 

etc aac ggg acc cga ata cct gcg egg etc cgc cat cga gga etc age 95 
Leu Asn Gly Thr Arg He Pro Ala Arg Leu Arg His Arg Gly Leu Ser 

20 25 .30 

gtg cgc tgc get get gtg geg gge ggc gcg gcc gag gca ccg gca tec 144 
Val Arg Cys Ala Ala Val Ala Gly Gly Ala Ala Glu Ala Pro Ala Ser 
35 40 45 

acc ggc gcg egg ctg tec geg gac tgc gtc gtg gtg ggc gga ggc ate 192 
Thr Gly Ala Arg Leu Ser Ala Asp Cys Val Val Val Gly Gly Gly He 
50 55 60 

agt ggc etc tgc acc geg cag gcg ctg gcc acg egg cac ggc gtc ggg 240 
Ser Gly Leu Cys Thr Ala Gin Ala Leu Ala Thr Arg His Gly Val Gly 
65 70 75 80 

gac gtg ett gtc acg gag gcc cgc gcc cgc cec gge ggc aac att acc 288 
Asp Val Leu Val Thr Glu Ala Arg Ala Arg Pro Gly Gly Asn He Thr 
85 90 95 

acc gtc gag cgc cec gag gaa ggg tac etc tgg gag gag ggt ccc aac 336 
Thr Val Glu Arg Pro Glu Glu Gly Tyr Leu Trp Glu Glu Gly Pro Asn 
100 105 110 

age ttc cag ccc tec gac ccc gtt etc ace atg gcc gtg gac age gga 384 
Ser Phe Gin Pro Ser Asp Pro Val Leu Thr Met Ala Val Asp Ser Gly 
115 120 125 

ctg aag gat gac ttg gtt ttt ggg gac cea aac gcg ccg cgt ttc gtg 432 
Leu Lys Asp Asp Leu Val Phe Gly Asp Pro Asn Ala Pro Arg Phe Val 
130 135 140 

ctg tgg gag ggg aag ctg agg ccc gtg cea tec aag ccc gcc gac etc 480 
Leu Trp Glu Gly Lys Leu Arg Pro Val Pro Ser Lys Pro Ala Asp Leu 
145 150 155 160 
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ccg ttc ttc gat etc atg age ate eea ggg aag etc agg gee ggt eta 528 
Pro Phe Phe Asp Leu Met Ser lie Pro Gly Lys Leu Arg Ala Gly Leu 
165 170 175 

gge gcg ctt ggc ate cgc ccg cct cct cca ggc cgc gaa gag tea gtg 576 
Gly Ala Leu Gly He Arg Pro Pro Pro Pro Gly Arg Glu Glu Ser Val 
180 185 190 

gag gag ttc gtg cgc cgc aac etc ggt get gag gtc ttt gag cgc etc 624 
Glu Glu Phe Val Arg Arg Asn Leu Gly Ala Glu Val Phe Glu Arg Leu 
195 200 205 

att gag cct ttc tgc tea ggt gtc tat get ggt gat cct tct aag etc 672 
He Glu Pro Phe Cys Ser Gly Val Tyr Ala Gly Asp Pro Ser Lys Leu 
210 215 220 

age atg aag get gea ttt ggg aag gtt tgg egg ttg gaa gaa act gga 720 
Ser Met Lys Ala Ala Phe Gly Lys Val Trp Arg Leu Glu Glu Thr Gly 
225 230 235 240 

ggt agt att. att ggt gga ace ate aag aca att cag gag agg age aag 768 
Gly Ser He He Gly Gly Thr He Lys Thr He Gin Glu Arg Ser Lys 
245 250 255 

aat cca aaa cca ccg agg gat gee cgc ctt ccg aag cca aaa ggg cag 816 
Asn Pro Lys Pro Pro Arg Asp Ala Arg Leu Pro Lys Pro Lys Gly Gin 

260 265 . 270 

aca gtt gea tct ttc agg aag ggt ctt gee atg ctt cca aat gee att 864 
Thr Val Ala Ser Phe Arg Lys Gly Leu Ala Met Leu Pro Asn Ala He 
275 280 285 

aca tec age ttg ggt agt aaa gtc aaa eta tea tgg aaa etc aeg age 912 
Thr Ser Ser Leu Gly Ser Lys Val Lys Leu Ser Trp Lys Leu. Thr Ser 
290 295 300 

att aca aaa tea gat gae aag gga tat gtt ttg gag tat gaa aeg cca 960 
He Thr Lys Ser Asp Asp Lys Gly Tyr Val Leu Glu Tyr Glu Thr Pro 
305 310 315 320 

gaa ggg gtt gtt teg gtg cag get aaa agt gtt ate atg act att cca 1008 
Glu Gly Val Val Ser Val Gin Ala Lys Ser Val He Met Thr He Pro 

325 330 335 

tea tat gtt get age aac att ttg egt cca ctt tea age gat get gea 1056 
Ser Tyr Val Ala Ser Asn He Leu Arg Pro Leu Ser Ser Asp Ala Ala 
340 345 350 

gat get eta tea aga ttc tat tat cca ccg gtt get get gta act gtt 1104 
Asp Ala Leu Ser Arg Phe Tyr Tyr Pro Pro Val Ala Ala Val Thr Val 
355 360 365 

teg tat cca aag gaa gea att aga aaa gaa tgc tta att gat ggg gaa 1152 
Ser Tyr Pro Lys Glu Ala He Arg Lys Glu Cys Leu He Asp Gly Glu 
370 375 380 

etc cag ggc ttt ggc cag ttg cat cca cgt agt caa gga gtt gag aca 1200 
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Leu Gin Gly Phe Gly Gin Leu His Pro Arg Ser Gin Gly Val Glu Thr 
385 390 395 400 

tta gga aca ata tac agt tec tea etc ttt cca aat cgt get cct gac 1248 
Leu Gly Thr He Tyr Ser Ser Ser Leu Phe Pro Asn Arg Ala Pro Asp 
405 410 415 

ggt agg gtg tta ctt eta aac tac ata gga ggt get aca aac aca gga 1296 
Gly Arg Val Leu Leu Leu Asn Tyr He Gly Gly Ala Thr Asn Thr Gly 
420 425 430 

att gtt tec aag act gaa agt gag ctg gtc gaa gea gtt gac cgt gac 1344 
He Val Ser Lys Thr Glu Ser Glu Leu Val Glu Ala Val Asp Arg Asp 
435 440 445 

etc cga aaa atg ctt ata aat tct aca gca gtg gac cct tta gtc ctt 1392 
Leu Arg Lys Met Leu He Asn Ser Thr Ala Val Asp Pro Leu Val Leu 
450 455 460 

ggt gtt cga gtt tgg cca caa gee ata cct cag ttc ctg gta gga cat 1440 
Gly Val Arg Val Trp Pro Gin Ala He Pro Gin Phe Leu Val Gly His 
465 470 475 480 

ctt gat ctt ctg gaa gee gca aaa get gee ctg gac cga ggt ggc tac 1488 
Leu Asp Leu Leu Glu Ala Ala Lys Ala Ala Leu Asp Arg Gly Gly Tyr 
485 490 495 

gat ggg ctg ttc eta gga ggg aac tat gtt gca gga gtt gee ctg ggc 1536 
Asp Gly Leu Phe Leu Gly Gly Asn Tyr Val Ala Gly Val Ala Leu Gly 
500 505 510 

aga tgc gtt gag ggc gcg tat gaa agt gee teg caa ata tct gac ttc 1584 
Arg Cys Val Glu Gly Ala Tyr Glu Ser Ala Ser Gin He Ser Asp Phe 
515 520 525 

ttg ace aag tat gee tac aag tga 1608 
Leu Thr Lys Tyr Ala Tyr Lys 
530 535 



<210> 47 
<211> 535 
<212> PRT 
<213> Zea mays 



<400> 47 



Met Val 


Ala 


Ala Thr 


Ala 


Thr 


Ala 


Met 


Ala 


Thr 


Ala 


Ala 


Ser 


Pro 


Leu 


1 




5 










10 










15 




Leu Asn 


Gly 


Thr Arg 


He 


Pro 


Ala 


Arg 


Leu 


Arg 


His 


Arg 


Gly 


Leu 


Ser 






20 








25 










30 






Val Arg 


Cys 


Ala Ala 


Val 


Ala 


Gly 


Gly 


Ala 


Ala 


Glu 


Ala 


Pro 


Ala 


Ser 




35 








40 










45 








Thr Gly 


Ala 


Arg Leu 


Ser 


Ala 


Asp 


Cys 


Val 


Val 


Val 


Gly 


Gly 


Gly 


He 


50 








55 










60 










Ser Gly 


Leu 


Cys Thr Ala 


Gin 


Ala 


Leu 


Ala 


Thr 


Arg 


His 


Gly 


Val 


Gly 


65 






70 










75 










80 


Asp Val 


Leu 


Val Thr 


Glu 


Ala 


Arg 


Ala 


Arg 


Pro 


Gly 


Gly 


Asn 


He 


Thr 



62 



wo 01/68826 



PCT/EPOl/02753 



Thr 


Val 


Glu 


Arg 








100 


Ser 


Phe 


Gin 


Pro 






115 




Leu 


Lys 


Asp 


Asp 




130 






Leu 


Trp 


Glu 


Gly 


145 








Pro 


Phe 


Phe 


Asp 


Gly 


Ala 


Leu 


Gly 








180 


Glu 


Glu 


Phe 


Val 






195 




lie 


Glu 


Pro 


Phe 




210 






Ser 


Met 


Lys 


Ala 


225 








Gly 


Ser 


He 


He 


Asn 


Pro 


Lys 


Pro 








260 


Thr 


Val 


Ala 


Ser 






275 




Thr 


Ser 


Ser 


Leu 




290 






He 


Thr 


Lys 


Ser 


305 








Glu 


Gly 


Val 


Val 


Ser 


Tyr 


Val 


Ala 








340 


Asp 


Ala 


Leu 


Ser 






355 




Ser 


Tyr 


Pro 


Lys 




370 






Leu 


Gin 


Gly 


Phe 


385 








Leu 


Gly 


Thr 


He 


Gly 


Arg 


Val 


Leu 








420 


He 


Val 


Ser 


Lys 






435 




Leu 


Arg 


Lys 


Met 




450 






Gly 


Val 


Arg 


Val 


465 








Leu 


Asp 


Leu 


Leu 


Asp 


Gly 


Leu 


Phe 








500 


Arg 


Cys 


Val 


Glu 






515 




Leu 


Thr 


Lys 


Tyr 




530 







Pro 


Glu 


Glu 


Gly 


Ser 


Asp 


Pro 


Val 








120 


Leu 


Val 


Phe 


Gly 






135 




Lys 


Leu 


Arg 


Pro 




150 






Leu 


Met 


Ser 


He 


165 








He 


Arg 


Pro 


Pro 


Arg 


Arg 


Asn 


Leu 








200 


Cys 


Ser 


Gly 


Val 






215 




Ala 


Phe 


Gly 


Lys 




230 






Gly 


Gly 


Thr 


He 


245 








Pro 


Arg 


Asp 


Ala 


Phe 


Arg 


Lys 


Gly 








280 


Gly 


Ser 


Lys 


Val 






295 




Asp 


Asp 


Lys 


Gly 




310 






Ser 


Val 


Gin 


Ala 


325 








Ser 


Asn 


He 


Leu 


Arg 


Phe 


Tyr 


Tyr 








360 


Glu 


Ala 


He 


Arg 






375 




Gly 


Gin 


Leu 


His 




390 






Tyr 


Ser 


Ser 


Ser 


405 








Leu 


Leu 


Asn 


Tyr 


Thr 


Glu 


Ser 


Glu 








440 


Leu 


He 


Asn 


Ser 






455 




Trp 


Pro 


Gin 


Ala 




470 






Glu 


Ala 


Ala 


Lys 


485 








Leu 


Gly 


Gly 


Asn 


Gly 


Ala 


Tyr 


Glu 








520 


Ala 


Tyr 


Lys 








535 







90 




Tyr 


Leu 


Trp Glu 


105 






Leu 


Thr 


Met Ala 


Asp 


Pro 


Asn Ala 






140 


Val 


Pro 


Ser Lys 






155 


Pro 


Gly 


Lys Leu 




170 




Pro 


Pro 


Gly Arg 


185 






Glv 


Ala 


Glu Val 


Tyr 


Ala 


Gly Asp 






220 


Val 


Trp 


Arg Leu 






235 


Lys 


Thr 


He Gin 




250 




Arg 


Leu 


Pro Lys 


265 






Leu 






Lys 


Leu 


Ser Trp 






300 


Tyr 


Val 


Leu Glu 






315 


Lys 


Ser 


Val He 




330 




Arg 


Pro 


Leu Ser 


345 






Pro 


Pro 


Val Ala 


Lys 


Glu 


Cys Leu 






380 


Pro 


Arg 


Ser Gin 






395 


Leu 


Phe 


Pro Asn 




410 




He 


Gly 


Gly Ala 


425 






Leu 


Val 


Glu Ala 


Thr 


Ala 


Val Asp 






460 


He 


Pro 


Gin Phe 






475 


Ala 


Ala 


Leu Asp 




490 




Tyr 


Val 


Ala Gly 


505 






Ser 


Ala 


Ser Gin 







95 




Glu 


Gly 


Pro 


Asn 




110 






Val 


Asp 


Ser 


Gly 


125 








Pro 


Aro 


Phe 


Val 


Pro 


Ala 


Asp 


Leu 








160 


Arg 


Ala 


Gly 


Leu 






175 




Glu 


Glu 


Ser 


Val 




190 






Phe 


Glu 


Arg 


Leu 


205 








Pro 


Ser 


Lys 


Leu 


Glu 


Glu 


Thr 


Gly 








240 


Glu 


Arg 


Ser 


Lys 






255 




Pro 


Lys 


Gly 


Gin 




270 






Pro 


Asn 


Ala 


He 


285 








Lys 


Leu 


Thr 


Ser 


Tyr 


Glu 


Thr 


Pro 








320 


Met 


Thr 


He 


Pro 






335 




Ser 


Asp 


Ala 


Ala 




350 






Ala 


Val 


Thr 


Val 


365 








He 


Asp 


Glv 


Glu 


Gly 


Val 


Glu 


Thr 








400 


Arg 


Ala 


Pro 


Asp 






415 




Thr 


Asn 


Thr 


Gly 




430 






Val 


Asp 


Arg 


Asp 


445 








Pro 


Leu 


Val 


Leu 


Leu 


Val 


Gly 


His 








480 


Arg 


Gly 


Gly 


Tyr 






495 




Val 


Ala 


Leu 


Gly 




510 






He 


Ser 


Asp 


Phe 



525 



63 



This Page is Inserted by IFW Indexing and Scanning 
Operations and is not part of the Official Record 

BEST AVAILABLE IMAGES 

Defective images within this document are accurate representations of the original 
documents submitted by the applicant. 

Defects in the images include but are not limited to the items checked: 

□ BLACK BORDERS 

□ IMAGE CUT OFF AT TOP, BOTTOM OR SffiES 

□ FADED TEXT OR DRAWING 

□ BLURRED OR ILLEGIBLE TEXT OR DRAWING 

□ SKEWED/SLANTED IMAGES 

□ COLOR OR BLACK AND WHITE PHOTOGRAPHS 

□ GRAY SCALE DOCUMENTS 

□ LINES OR MARKS ON ORIGINAL DOCUMENT 

□ REFERENCE(S) OR EXHIBIT(S) SUBMITTED ARE POOR QUALITY 

□ OTHER: 

IMAGES ARE BEST AVAILABLE COPY. 
As rescanning these documents will not correct the image 
problems checked, please do not report these problems to 
the IFW Image Problem Mailbox. 



